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ABSTRACT 

The strong deviation in the properties of X-ray clusters from simple scaling laws highlights 
the importance of non-gravitational heating and cooling processes in the evolution of proto- 
cluster gas. We investigate this from two directions: by finding the amount of 'excess energy' 
required in intra-cluster gas in order to reproduce the observed X-ray cluster properties, and 
by studying the excess energies obtained from supernovae in a semi-analytic model of galaxy 
formation. Using the insights obtained from the model, we then critically discuss possible 
ways of achieving the high excess specific energies required in clusters. These include heating 
by supernovae and active galactic nuclei, the role of entropy, and the effect of removing gas 
through radiative cooling. 

Our model self-consistently follows the production of excess energy and its effect on gas 
halos. Excess energy is retained in the gas as gravitational, kinetic and/or thermal energy. The 
density profile of a gas halo is then selected according to the total energy of the gas. Our 
principle assumption is that in the absence of non-gravitational processes, the total energy 
of the gas scales as the gravitational energy of the virialized halo — a self-similar scaling law 
motivated by hydrodynamic simulations. This relation is normalized by matching the model 
to the largest observed clusters. 

We model the gas distributions in halos by using a 2-parameter family of gas profiles. In 
order to study the sensitivity of results to the model, we investigate four contrasting ways of 
modifying gas profiles in the presence of excess energy. In addition, we estimate the minimum 
excess energy required in a fiducial cluster of around 2 keV in temperature by considering all 
available gas profiles. We conclude that the excess energies required lie roughly in the range 
1-3 keV/particle. 

The observed metallicities of cluster gas suggests that it may be possible for supernovae 
to provide all of the required excess energy. However, we argue that this scenario is only 
marginally acceptable and would lead to highly contrived models of galaxy formation. On the 
other hand, more than enough energy may be available from active galactic nuclei. 

Key words: galaxies: clusters: general - galaxies: formation - galaxies: evolution - cooling 
flows - X-rays: galaxies 



1 INTRODUCTION 



Much progress has been made in recent years in the modelling of 
galaxy formation, partly in response to an unprecedented amount of 
new data, especially for galaxies at high redshift. This paper how- 
ever aims to constrain the model from the high-mass end, by tack- 
ling the properties of X-ray clusters. This has the advantage that the 
results are insensitive to the detailed physics of star formation and 
feedback. Only a small fraction of the hot gas in clusters is able to 
cool in a Hubble time, so that any star formation has little effect on 
the structure of the gas halo. Since star formation and feedback are 



two of the least understood components of galaxy formation, this 
seems to be a natural approach to take. 

On the other hand. X-ray clusters do contain a fossil record 
of the complex star-formation history of their progenitors. The 
amount of gas left in a cluster's halo depends on the amount con- 
sumed in processes such as star formation. The heavy elements (or 
metals) observed in the gas are the result of enrichment by super- 
novae over billions of years. Like the metals, the energy injected 
into the gas by supernovae and active galactic nuclei (AGN) is re- 
tained in the gas if it is not radiated. We shall be particularly inter- 
ested in this 'excess energy' that is retained in present day clusters. 
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X-ray clusters therefore provide important constraints on the his- 
tory of a large sample of baryons. 

Broadly speaking, a complex physical system can be studied 
via numerical methods, e.g. N-body simulations, or via analytic cal- 
culations. In galaxy formation theory, the semi-analytic approach 
has come to refer to more than just an intermediate line of attack, 
but to a specific class of models that use the hierarchical merger 
tree as their starting poin t. In the Cold Dark Matter (CDM) model 
(Blumenthal et al. 1984), small halos virialize first and progres- 



sively collapse into larger and larger halos. The merger tree follows 
the masses of these halos as a function of time. The evolution of 
the baryonic component in these halos — which comprises 1/10 
of the total mass — receives a simplified yet physical treatment that 
models processes such as cooling, star formation and supernova 
feedback, to name a few. 

Although N-body simulations of dark matter (DM) clustering 
now provide perhaps the best understood piece in the jigsaw of 
how galaxies formed, the evolution of the baryonic component re- 
mains much less well understood. In both hydrodynamic-l-DM sim- 
ulations and semi-analytic models (SAMs), many of the above gas 
processes need to be approximated by simple rules. Nevertheless, 
using SAMs, we are able to efficiently explore the unknown pa- 
rameters in these processes, and study the range of behaviour in 



in modelling many properties of galaxies (jWhite & Frenk 1991; 


Kauffmann, White, & Guiderdoni 1993|: 


::ole et al. 1994|; Kauff- 


mann & Chariot 1998; |Baugh et al. 199S 
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1998; puiderdoni et al. 199^ 

In this paper, we investigate the effect of excess energy on 
the density profiles of gas halos, and thus on the properties of X- 
ray clusters. Excess energy is retained in the gas as thermal, grav- 
itational and/or kinetic energy as it passes through a merger tree. 
Even if the gas is ejected from a halo, it is expected to recoUapse 
into a larger halo at a later time, thus the excess energy is not lost. 
As a first approximation, the excess energy in a gas halo is given 
by the total energy obtained from non-gravitational heating, minus 
the energy lost via radiative cooling. By non-gravitational heating 
we refer to heating by sources such as supemovae and AGN. The 
total energy released by such sources (though not necessarily in- 
jected into or retained by the gas) comes to several keV per particle 
when averaged over all baryons in the universe. It therefore has the 
potential to strongly influence the properties of X-ray clusters and 
galaxies. 

It has been known for some time that the match between the- 
oretical predictions and the observed properties of X-ray clusters is 
significantl y improved i f we assume that the gas is 'pre-heated' in 



some way (Kaiser 1991). Hydrod ynamic simulations without non- 



gravitational heating or cooling (Navarro, Frenk, & White 1995 



Bryan & Norman 1998) obtain X-ray clusters that are approxi- 
mately 'self-similar', in the sense that small clusters (with tempera- 
tures 2 X 10^ K) are similar to large clusters (T ~ 10* K) scaled 
down in size. (Note that densities do not change in such a scaling.) 
However, the gas halos of observed clusters are not self-similar. For 
example, the X-ray luminosities of small clusters are an order of 
magnitude less than those predicted by scaling down the luminosi- 
ties of large clusters in this way. This suggests that the gas distribu- 
tions of small clusters are less concentrated than in large clusters. In 
order to break the self-similarity of X-ray clusters, excess energy is 
generally required. Excess energy affects small clusters much more 
than large ones. It can make the gas distribution more extended, or 
even remove some gas from the halo. Different models for heating 
clusters and breaking their self-similarity have been studied by a 



number of aut hors (jKaiser 199l|; [Evrard & Henry 1991 ; Metzler 
& Evrard 199 4; [Navarro, Frenk, & White 1995|; Cavaliere, Menci, 
& Tozzi 1997 ; |Wu, Fabian, & Nulsen 1998|; Ponman. Cannon, & 
; palogh, ' 



Navarro 1999; 



Babul, & Patton 1999; Loewenstein 1999 
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In order to model the effect of excess energy on gas halos, it 
is necessary to have a continuous range of gas profiles to choose 
from. The gas profile with density proportional to r^^ has been 
used successfully in many SAMs to model galaxies. However, it is 
too simple for modelling the properties of X-ray clusters. In par- 
ticular, the core of the gas density profile has to be flattened in 
order to obtain results that match the data (Wu, Fabian & Nulsen 
1998; WFN98). In WFN98 we introduced a family of isothermal 
gas profiles into our SAM. We assumed the gas to be in hydro- 
static equilibrium inside potential wells given by Navarro, Frenk 
& White (1997; NFW97) density profiles. This family of gas pro- 
files enabled us to increase the temperature of a gas halo uniformly, 
according to the excess energy in the gas. The main results from 
that paper are that we were able to fit the observed properties of 
X-ray clusters, including their gas fractions, metallicities. X-ray 
luminosity-temperature relation, temperature function. X-ray lumi- 
nosity function and mass-deposition-rate function, by including ex- 
cess energies of ~ 1 keV/particle. 

However, for a given total energy possessed by the gas halo 
(the sum of its thermal and potential energies), the isothermal pro- 
file represents only one solution out of a continuous range of pos- 
sible solutions. Furthermore, it is uncertain how heating modifies a 
gas halo, since that depends on details of how the heating occurred. 
We therefore need to test the sensitivity of results to the way that 
we modify the gas halo when excess energy is present. To do this, 
we extend the family of isothermal profiles by requiring that gas 
halos obey polytropic equations of state: P oc p"^, where P is pres- 
sure and pg is gas density. Thus for a given potential well and to- 
tal gas mass, the gas profile has two degrees of freedom, given by 
the parameter 7, which effectively specifies the shape of the tem- 
perature profile, and the normalization of the temperature profile. 
The isothermal profiles are retrieved when 7=1, while progres- 
sively steeper temperature gradients are obtained by increasing 7. 
We thus have the choice of increasing the temperature of a gas halo 
uniformly with radius or preferentially towards the centre, depend- 
ing on the 'heating model' that is used. One of the main purposes of 
this paper is to constrain the level of excess energy that intra-cluster 
gas must have in order to match the observed properties of X-ray 
clusters. We then critically discuss possible ways of obtaining this 
level of heating. 

The SAM used in this paper is based on that described by 
Nulsen & Fabian (1997, 1995; NF97 and NF95). A discussion of 
the main areas of difference with other SAMs is given in NF97. 
However, our study of X-ray clusters is not affected by such differ- 
ences, as their X-ray properties depend almost entirely on their gas 
profiles only. 

We use an open cosmology with Q.m = 0.3 and no cosmo- 
logical constant. A Hubble parameter of Ho = 50 km s^^ Mpc^^ 
is assumed throughout. We assume that density fluctuations are de- 
scribed by a CDM power spectrum with a primordial spectral in- 
dex of n = 1 and normalized to give — 0.75. In addition, 
we assume a baryon density parameter of Qt — 0.02h~^ (where 
Ho = lOO/i km s^^ Mpc^^) based on bi g-bang nucleosynthe- 



sis and deuterium a bundance measurements (Buries & Tytler 1998 
Buries et al. 1999 ). For h = 0.5, this implies 
initial gas fraction of ^b/^m ~ 0.27. 
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1.1 Plan of the paper 

The main resuhs from our model are discussed in sections || and|^ 
Section ^ investigates the excess energies required in X-ray 
clusters and the relevant parts of the model are described in sec- 
tions |1[ | and ^ 

Section ^ discusses the amount of excess energy obtainable 
from supernova heating in our model and therefore requires knowl- 
edge of our star formation model as described in the rest of sec- 
tion| 

In section y\ we discuss some effects not accounted for by our 
model that may possibly contribute to the excess energy. In the pro- 
cess, we give a more formal definition of excess energy and discuss 
the theory behind the concept in some detail. 

Finally, in section |^ we discuss four possible scenarios for 
breaking the self- similarity of clusters, aiming to be as model- 
independent as possible. We consider three sources of energy: su- 
pernovae, AGN and preferential removal of gas by cooling. We also 
discuss the role of entropy in this problem (section 8.2) and empha- 



size that both energy and entropy are important in determining the 
final gas distribution. In section M we summarize our conclusions. 



2 BRIEF DESCRIPTION OF THE MODEL 

We begin with a general description of our model which can be 
applied to any reasonable gas and DM halo profiles. More detailed 
discussions of the gas processes and galaxy formation model can 
be found in NF95 and NF97, which assumed essentially the same 
physics as used here. In Appendix A we apply the rules given in 
this section to the set of density profiles that we shall adopt. 



2.1 Merger trees 

Merger trees of virialized halos are simulated using the Cole & 
Kaiser (1988) block model. In a 'complete' simulation, we use 
20 levels of collapse hierarchy where the smallest regions are 
1.5 X 1O"M0 in mass. In the block model, masses increase by 
factors of 2 between levels, so that the mass of the largest block is 
7.9x IO^^Mq. This allows us to simulate the full range of structures 
from dwarf galaxies to the largest present-day clusters. However, 
if we are considering X-ray cluster properties only, it is ~ 1000 
times faster to simulate only the top 10 levels of the collapse tree. 
The mass of the smallest regions is then 2^" x 1.5 x 10^" = 
1.5 x 1O^^M0. In such low-resolution simulations some additional 
assumptions need to be made, such as the value of the gas fraction 
left over from the formation of galaxies. 

Since every collapse of a block (which corresponds to a major 
infall or merger) at least doubles the mass of the largest progenitor 
halo, a new halo is said to virialize with each collapse. The virial 
radius, r2oo, is defined such that the mean density within it is 200 
times the background density of an Einstein-de Sitter universe of 
the same age. The total mass of the halo inside r2oo is equal to 
the mass of the collapsed block. Likewise, the gas mass inside r2oo 
is the contribution from the entire block (unless the excess energy 
is so high that the gas halo is unbound). The new halo is given 
gas and DM density profiles, which allow the estimation of basic 
quantities such as the cooling time of the gas. From this starting 
point, the model proceeds to estimate the rate of star formation, 
supernova feedback, metal enrichment and other quantities that can 
be compared with observations. At the next merger, the properties 



of the progenitor halos (e.g. the mass of gas remaining) are then 
incorporated into the new halo. 

A collapse which is followed too closely by a larger-scale col- 
lapse does not have time to form a virialized halo. It is therefore 
not counted as a separate collapse. We allow a minimum time in- 
terval between collapses which is parametrized as a multiple of the 
dynamical time. Our results are not sensitive to this parameter and 
it is given a value of I . 



2.2 Cold and hot collapses 

For a gas halo to be considered hydrostatic, the gas at any radius 
has to remain still for at least the time it takes for sound to travel to 
the centre, which can itself be approximated by the free-fall time. 
As discussed in NF95, if the ratio of cooling time to free-fall time 
to the centre, r = tcooi/iff, is less than ~ 1, then the gas cools 
fast enough that it is not hydrostatically supported. It fragments 
and collects into cold clouds which we assume to form stars with a 
standard or slightly modified initial mass function (IMF). We refer 
to this as a cold collapse and the gas that takes part in it as cold gas. 

When r > 1, a hydrostatic atmosphere of hot gas (at roughly 
the virial temperature) is able to form. In this case, a cooling flow 
occurs if some gas has time to cool before the next collapse. Cool- 
ing gas flows inward subsonically and remains hydrostatically sup- 
ported. In clusters of galaxies, cooling flows are common and ob- 
servations show that the gas that cools does not form stars with a 
standard IMF b ut must remai n as very small, cold clouds or form 
low-mass stars (Fabian 1994). We refer to the product simply as 



baryonic dark matter (BDM). A possible mechanism for the for- 
mation of low-ma ss stars in cooling flows is described by M athews 
& Brighenti (1999), for the case of elliptical galaxies. 

To estimate the masses of hot and cold gas produced in a col- 
lapse, we use the gas and total density profiles to estimate tcooi and 
as functions of radius. To simplify computation, is estimated 
using the free-fall time of a test particle in a uniform background 
density, i.e. tff = -y/STr/lGGp, where G is the gravitational con- 
stant and the total density at the radius concerned is substituted for 
p. (This gives a slight overestimate of tg, as density actually in- 
creases towards the centre.) We thus obtain r(r), and compare it 
to a critical value, ro, to determine if gas is hot or cold. In well- 
behaved cases T increases monotonically with radius, so that there 
exists a unique radius r-cf, inside of which gas is labelled as cold, 
outside as hot. As halo mass increases, the trend is for to move 
from outside the virial radius to the centre. In other words, cold 
collapse gives way to hot collapse as we go to more massive ha- 
los. This transition is quite abrupt and takes place over about one 
decade in mass. 

From the above, it is clear that no single gas profile can al- 
ways describe the gas halo. Cooling modifies the gas distribution, 
and in a cold collapse the assumption of hydrostatic equilibrium 
breaks down completely. However, the gas profile used in the model 
is only notional — defined as that obtained in a notional collapse 
with cooling ignored (Nulsen, Barcons & Fabian 1998). Used in 
this way, it allows us to estimate the behaviour of different sub- 
sets of gas. In the case of hot halos, if the part that has cooled is 
small compared to the whole, then the density and temperature of 
gas away from the cooled region do not change significantly as the 
halo reestablishes hydrostatic equilibrium. The original gas profile 
therefore gives reasonable estimates of bulk properties. 
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2.2.1 The criterion when excess energy is large 

If the excess energy from heating is large enough to be comparable 
to the binding energy of the gas halo (as defined in section W), then 
r may not increase monotonically with radius (see Appendix A for 
examples). Such cases can account for a fair fraction of low-mass 
galaxies because of their smaller binding energies. This raises the 
question of whether gas with r < ro outside a core of gas where 
r > To still ends up cold after collapse. Since the value of r and its 
interpretation are approximate, we opt for a simple criterion in such 
cases, which determines whether all or none of the gas halo takes 
part in a cooling flow (Appendix A). We note that r(r) is a fairly 
flat function of radius if the strongly heated gas halo is isothermal. 

2.3 Star formation, supernova feedback, and cooling flows 

Star formation is presumed to proceed rapidly in cold gas and leads 
quickly to type II supemovae. This is assumed to continue until the 
energy from supemovae is sufficient to eject the remaining gas in 
the halo to infinity, or until the cold gas is used up. If the gas halo 
is not ejected, supernova energy can modify the gas density profile 
by increasing the total energy of the remaining gas (see section ^. 
The effect of this is generally small but is included for consistency. 
The remaining gas, which is hot, may then take part in a cooling 
flow, depositing BDM if it manages to cool by the next collapse or 
the present day. For halos which contain only hot gas or cold gas, 
the situation is naturally simpler than described. 

We only follow the production of type II supemovae (SNII) 
in our model. Precise knowledge of the IMF is not required, since 
we only need to know the number of SNII resulting from a certain 
amount of star formation. It is generally assumed that the progen- 
itors of SNII are stars of mass M > 8M0. For a standard IMF 
(more precisely the Miller-Scalo IMF), we adopt the estimate of 
one SNII for every 8OM0 of stars formed with M < IMq (Thomas 
& Fabian 1990). In the simulations, we make the simplification that 
stars with M > IMq are instantaneously recycled, so that only the 
total mass of stars with M < IMq is recorded. This allows us to 
calculate the mass of stars remaining in present-day clusters. Since 
the lifetime of a star is approximately W^^ {M / Mq)~'' years, the 
recorded stellar mass is a good approximation of this quantity. 

[The above suggests that the amount of gas in a halo could be 
overestimated by the model, since in reality, stars of intermediate 
mass (IMq < M < 8Mq) recycle their gas as planetary nebulae 
on intermediate time scales. However, we find that in newly-formed 
halos, the stellar mass is almost always < 1/10 of the gas mass, so 
that the effect of recycled gas on the latter is small (halos of a few 
IO^^Mq are an exception, as ~ 1/3 of them have more stars than 
this). Another minor problem occurs when only a small fraction of 
the gas in a halo is cold, so that most of the cold gas forms stars. 
In this case, the assumption of instantaneous recycling can cause 
the amount of star formation to be overestimated (in the extreme, 
all of the cold gas can be converted into stars with M < IMq ). 
Fortunately, the fraction of stars formed in such situations is very 
small, so that the error in the stellar mass of present-day clusters is 
less than I per cent.] 

In the simulations, we follow NF97 by boosting the above su- 
pernova rate by a factor of 5. Hence each SNII is associated with 
I6M0 of stars formed with AI < IMq. This corresponds to us- 
ing a flatter slope for the IMF. Since the bulk of star formation in 
our model occurs as massive bursts in dwarf galaxies, it should not 
be surprising to find that the IMF is modified under such circum- 
stances. 



To give an actual example, a power-law IMF with a slope of 
X = 0.9 (the Salpeter IMF has x — 1.35), and lower and upper 
cutoffs of O.IM0 and SOMq, gives 1 SNII for every 15Mq of stars 
with M < IMq. (Results are not very sensitive to the upper cutoff, 
because very massive stars are rare.) Using this IMF, we can esti- 
mate the error in our assumption that the stellar mass of a present- 
day cluster is given by the stars with M < IMq . Suppose the stars 
in the cluster have an age of 5 Gyr instead of 10 Gyr, then the sur- 
viving stars would be given by AI < 0.5^^'''^ = 1.26A4q. For the 
above IMF, the stellar mass in the range O.IM© < AI < I.26M0 
is 1 1 per cent greater than that in the range 0.1 Mq < AI < IMq. 
The stellar mass of model clusters are therefore correct to ~ 10 per 
cent. 

The energy per supernova av ailable for the ejection of gas is 
parametrized as 4 x 10^"esN erg (Spitzer 1975). Although the to- 
tal energy released by a supernova is typically ~ 10^^ erg, a large 
fraction of this is likely to be radiated, especially if the supernova 
explodes in cold gas. Each SNII is assumed to release an average 
of O.O7M0 of iron (Renzini et al. 1993). The solar iron abundance 
is taken to be 0.002 by mass (Allen 1976). Renzini et al. find that 
the average iron yield is fairly insensitive to the slope x of the IMF. 
We note that a more rec ent compilation of avera ge iron yields from 
a range of SNII models ( Nagataki & Sato 1998 ) shows a wider dis- 



persion, ranging from 0.07 to 0. 14M0 of iron per SNII. However, 
most of these SNII models assume that the progenitor stars have 
solar metallicity, whereas the bulk of star formation in our model 
occurs in low metallicity dwarf galaxies. If we only consider the 
low metallicity SNII models, then the range narrows to about 0.07- 
O.O9M0 of iron per SNII. 

When a new halo collapses, the mean iron abundance and 
mean excess specific energy (-Ecxccss) of the gas are calculated and 
assigned to the gas halo. The excess energy of a gas halo, as it 
name implies, is the increase in its total energy (defined below) 
relative to the total energy it would have in the absence of any non- 
gravitational processes. In the model it is approximated by the total 
energy injected by supemovae, minus the energy radiated in pro- 
genitor halos, over the history of the gas. If some gas is removed 
from the gas halo by a cooling flow, iScxccss is assumed to stay the 
same for the remaining hot gas. The reduction in iSoxccss by radia- 
tive cooling is easily accounted for, since the cooled gas is either 
converted to stars/BDM, or is ejected from the halo by supemovae. 
Since we assume that the gas is always ejected at the escape ve- 
locity of the halo, the resulting value of iJoxcoss is simply given 
by the binding energy of the gas halo (as defined below). Other 
mechanisms that may affect iScxccss but are not accounted for by 
the model are discussed in section ^. In particular, if the gas is dis- 
placed by strong heating, this can lead to an extra 'gravitational 
contribution' to the excess energy, that is usually positive. Unfortu- 
nately, this contribution is in general difficult to compute (without 
hydrodynamic simulations) and is likely to be model-dependent as 
well. (The approximation to iScxcoss made by the model may be 
compared to the approximation made when inferring the cluster- 
ing of mass from the clustering of galaxies, which is traditionally 
handled by a 'bias parameter'.) 



3 THE DISTRIBUTION OF TOTAL DENSITY IN HALOS 

We begin by specifying the total density profile of a halo, which 
allows us to derive the shape of the potential well. This is then used 
in the following section to derive gas density profiles. 

From a series of N-body simulations in different cosmologies 
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Figure 1. Scatter plot of concentration c vs. halo mass. Halo masses take 
discrete values in the block model. Each mass bin contains a maximum of 
100 points. Halos were selected randomly from the simulation regardless 
of redshift. The solid line gives the mean value of each mass bin, and the 
dotted lines are plotted one standard deviation from the mean. 



and with both CDM and power-law fluctuation spectra, Navarro, 
Frenk & White (1997; NFW) found that the density profiles of vki- 
alized halos obey a universal form, given by 

Scpcrit 



p(r) 



(1) 



(r/r«)(l+r-/r,)2' 

where pcrit = /8ttG and H is the Hubble parameter at the 
time of collapse. The characteristic density Sc is calculated ac- 
cording to a prescription described in the Appendix of NFW. This 
method amounts to setting the scale density, ps ~ Scpait, equal to 
3000 times the background density when the halo was 'assembled', 
subject to an appropriate definition of this assembly time. The as- 
sembly time is a function of halo mass and redshift of virialization 
only (given the cosmology and fluctuation spectrum). 

From the value of Sc and the mean density of the halo within 
'"200, the scale radius is uniquely determined. Thus Sc is the only 
'degree of freedom' in the profile. For convenience, x = r jr^ is 
often used to denote radius. The value of x at the virial radius, c = 
'"20o/'"s, is an important parameter known as the concentration. 

[On a technical point, our model actually differs slightly from 
the original NFW prescription. This is because NFW defined the 
mean density of a halo to be 200pcrit, whereas we have chosen to 
follow the spherical collapse model more closely when calculating 
the mean density. By following their prescription for calculating 
5c, we have preserved their explanation for its origin. However, 
quantities such as and c will differ slightly.] 

We make the further approximation that the NFW profile de- 
scribes the total density in a halo (i.e. including the gas density) and 
that it is truncated to zero for r > r2oo- This allows us to derive the 
gravitational potential as a function of x: 



ln(l + x) 
\x)^a\ i + 



1 + c 



(2) 



where a — AnGpsr^- 

To illustrate typical values of c obtained in this model. Fig- 
ure [l] shows a scatter plot of c against halo mass for our choice of 
cosmology and fluctuation spectrum. For halos that collapse at a 
given redshift, c increases substantially with decreasing mass, e.g. 
the steep upper edge of the distribution is given by halos that viri- 



(3) 



alize at z = 0. However, for a given mass bin, c decreases with 
increasing redshift. As a result, the mean value of c does not vary 
much with halo mass, because less massive halos are more likely 
to collapse at higher redshift. 



4 THE DISTRIBUTION OF GAS IN HALOS 

Given the NFW potential well (^ and the total gas mass within 
r2oo , we make two further assumptions in order to calculate the gas 
density profile. The first is that the gas is in hydrostatic equilibrium, 
i.e. 

dP _ _ d0 
dr dr ' 

and the second is that P and pg are related by some equation of 
state. For example, if we assume a perfect gas law and isothermal- 
ity, then P (x pg and the only parameter is the temperature, T. 
Once T is specified, the gas profile is uniquely determined. Below, 
we first describe the general procedure that we use to determine 
such parameters. 

We refer to the gas profile obtained in the absence of excess 
energy as the default profile. Since the NFW profile is not self- 
similar (see Fig. hi), it is not possible to define a self-similar default 
profile for gas halos. In the absence of heating, it is common to as- 
sume that (T) is proportional to (cr^) for the DM halo, where a is 
the velocity dispersion and the brackets denote some form of aver- 
age. However, (ct^) is non-trivial to compute for the DM halo, and 
we are more interested with the total energy of the gas halo than 
just its thermal energy, since any excess energy would be added to 
the former. In order to retain some level of self-similarity, we there- 
fore postulate that the total specific energy of the gas halo, iSgas, is 
proportional to the specific gravitational energy of the whole halo 
(which is modelled by the NFW profile): 



where iSgas is defined by 



Eg 



Ps 



3kT 



+ <^ dV. 



(4) 



(5) 



The above integrals are performed out to the radius r2oo, Aftot and 
i\fgas are the total mass and total gas mass respectively, and the 
total density ptot is given by NFW density profile. The Boltzmann 
constant is denoted by k and prriu is the mean mass per particle 
of the gas. Note that Eg^s < in order for the gas halo to remain 
gravitationally bound. 

The constant of proportionality A" is a parameter of the 
model. It is calibrated by requiring that the default profiles of the 
largest clusters approximate well those from X-ray observations. 
We match to the largest observed clusters because if heating does 
occur, we expect it to have least effect on them. Once Eg^s is com- 
puted from (M), the gas profile is uniquely determined if it is se- 
lected from a family with only one parameter (e.g. the isothermal 
family). In general, a numerical procedure is required to search for 
the gas profile with the matching value of iSgas- The X-ray clus- 
ters obtained in this way do closely follow the self-similar scaling 
relations for Mtot, T and Lx. 

We refer to the value of | Eg^s \ given by (^ as the binding en- 
ergy. As its name implies, the binding energy is the excess specific 
energy required to unbind the gas halo. 



When the excess specific energy, Ec 
increased accordingly: 



, is non-zero, Ega,s is 
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K 



-pt^^4>AV + E 



excess • 



(6) 



This allows the heated gas profile to be found. In general, as -Eoxccss 
increases, the gas temperature increases and the gas distribution be- 
comes more extended (i.e. the density profile becomes flatter). Thus 
the excess energy goes into increasing both the thermal energy and 
the potential energy of the gas halo. 

If an isothermal family of gas profiles is used, heating in- 
creases the temperature uniformly with radius. Frequently, prop- 
erties such as the luminosity of an X-ray cluster or the amount of 
gas able to cool in a given time are sensitive only to the gas density 
near the centre. Therefore, if we increase the temperature prefer- 
entially towards the centre, then we can obtain the same changes 
in these properties for less excess energy. A convenient way of 
modelling non-isothermal profiles is to use a polytropic equation 
of state: P oc Pg. There are then two degrees of freedom, rep- 
resented by 7 and the constant of proportionality in the polytropic 
equation. Since there are two parameters, a continuous range of gas 
profiles now have the same value of -Egas. Thus a further constraint 
is required to determine the gas profile uniquely. 

A heating model is obtained by specifying 

a) the constraint used to determine the default profile, and 

b) the path in parameter space followed by the gas profile as 
Scxccss increases. 

In order to obtain a good match to the largest clusters, the param- 
eter K is allowed to depend on a). Thus, K may also be regarded 
as part of the heating model. The specification of a heating model 
is of course artificial; in reality, the gas profile is determined by ad- 
ditional factors such as the gas entropy distribution and how shock 
heating occurs. In lieu of a more complex model, we shall use a 
few contrasting heating models to test the sensitivity our results. 

4.1 A 2-paraineter family of gas density profiles 

We now derive the family of gas profiles used in our model, assum- 
ing a polytropic equation of state and a perfect gas law. If we first 
let 7 = 1, then T is constant and equation ^ gives 



(7) 



I for the NFW potential yields 



Pg(r) oc exp \^ —<f> 

Inserting the expression ( 

Pg(r)(x(l-fx)''/^ (8) 

where r) = fimiia/{kT) is a dimensionless parameter that charac- 
terizes the slope of the density profile. Recall that a is the character- 
istic gravitational potential of the NFW profile. The mean value of 
ri obtained by fitting this model to highly l uminous X-ray clusters 
is approximately 10 ( Ettori & Fabian 1999| ). 

For 7 7^ 1, we use P oc to eliminate P in equation^ and 
then use Pg~^ oc T to get 



_d_ 

dr 



kT 



7 — 1 d</> 



prriH J 7 dr 

Substituting for the potential gives 



T 

7200 



1 + 



- 1 / ln(l + x) ln(l + c) 

V200 

7 \ a:: c 



(9) 



(10) 



where 77200 ~ t-imnce / {kT2oo) is the value of rj at the virial ra- 
dius (where x = c). Thus, using 7 > 1 causes the temper- 
ature to increase monotonically towards the centre. Substituting 

Pg oc T^^/^-^', we get 



10.000 








: 


1 .000 




' " ^ ' ■ ■ ^ 






0.100 










0.010 








\ : 


0.001 











Figure 2. Gas density profiles, with parameters representative of those ob- 
tained in heating models A and B (see Fig. ^ and text). The same total 
gas mass and NFW potential well were used throughout (we set c = 5). 
The solid curve (the default profile for the purposes of this figure) uses 
77200 = 10 and 7 = 1. The series of dotted profiles have 7 = 1 but decreas- 
ing values of 77200: namely 77200 = 8.5, 7.1, 6.0 and 5.1. These values were 
chosen so that their total specific energies, Egas , increase at regular inter- 
vals. The flattest profile, 77200 = 5.1, has zero total energy and is marginally 
bound. The series of dashed curves have the same total energies as the 
dotted curves, but have the following pai'ameters: (r;200i7) = (lOi 1-1). 
(10, 1.3), (8.7, 5/3) and (6.8, 5/3). Notice that for the same increase in 
Egas , increasing 7 has a greater effect on densities at small radii than re- 
ducing 7^200- Finally, the dot-dashed curve uses 7 = 1.2 and 7)200 = 28, 
and is representative of default profiles obtained in Models C and D. 



Pg,200 



7-1 
1 H 77200 



ln(l + x) ln(l + c) 



,(11) 



where Pg,200 is the gas density at the virial radius. It is straight- 
forward to show that this approaches the isothermal form (H) as 
7^1. We henceforth use the parameters 7 and 77200 to specify the 
gas profile. 

It is also useful to coinpute the 'entropy', s = T jr^J'^ , where 
7ie is the electron density, and rie oc pg. For our purposes, s may 
simply be regarded as a label for the adiabat that the gas is on. For 
the gas to be stable to convection, the entropy must increase with 
radius. When 7 = 5/3, the entropy is constant with radius, thus the 
atmosphere is marginally stable to convection. Atmospheres with 
higher values of 7 and steeper temperature gradients convect to re- 
duce the temperature gradient. Hence 5/3 is the maximum value of 
7 used in the model. The minimum value used is 7 = 1. We do 
not use lower values of 7 as there is little evidence for the tem- 
perature in halos to increase with radius, both from X-ray cluster 
observations and hydrodynamic simulations. 

In Figs. 1^ |3| and ^ we display the density, temperature and en- 
tropy profiles of a selection of gas halos covering a range of 77200 
and 7 values. All other parameters, in particular the total gas mass 
and NFW potential well, have been kept constant. In each figure, 
the series of dotted curves and dashed curves represent two differ- 
ent ways of heating the gas halo represented by the solid curve. In 
each series, the value of -Egas was required to increase at regular 
intervals from that of the solid curve up to a value of zero. Hence 
the gas halo with the most energy in each series is only marginally 
bound. By comparing the two series it is evident that profiles with 
the same total energy can differ significantly. 
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Figure 3. As Fig. g, but showing temperature profiles (note that the temper- 
ature scale is linear). Temperatures have been normalized so that the soHd 
curve has a temperature of unity. Increasing 7 leads to steeper temperature 
gradients without changing the temperature at the virial radius {x = c). In 
contrast, reducing r;200 increases the temperature uniformly. 




on how the collapse occurred in detail. For example, how the gas 
collapsed relative to the dark matter affects how much energy was 
transferred between the two components. [However, we do assume 
that such processes result in the scaling law expressed in (^).] 

To calibrate K for the case of 7 = 1, we use the results of Et- 
tori & Fabian (1998), who fitted the surface brightness profiles of 
36 X-ray clusters with Lx ^ 10*^ erg s~^. When fitting to avoid 
any cooling flow region, they obtain a mean value of 77 = 10.29 
with an rms scatter of 1.55. (Since the temperature is constant, rj 
and 7^200 are the same.) In order to match this we set K = 1.2, 
which gives a mean value of 7; = 10.5 in the corresponding model 
clusters. However, the scatter of 77 in our model is only ~ 0.5. If 
we now set the g as fraction of clu sters equal to 0.17 (th e mean 
value obtained by Evrard 199"/| and Ettori & Fabian 1999 , assum- 
ing h = 0.5), we find that the model clusters naturally follow the 
observe d Lx — T relation for cl usters more luminous than 2 x 10'*^ 
erg s^^ (Allen & Fabian 1998a). (We refer to bolometric luminosi- 



ties throughout.) Note that this fit is possible because the largest ob- 
served clusters roughly follow the self-similar relation Lx oc T^, 
instead of the steeper relation obeyed by smaller clusters. 

Turning to the case of 7 = 1.2, we note that compared to the 
isothermal profiles these are almost always poorer fits to the surface 
brightness profiles of real clusters ( ^ttori & Fabian 1999| ). Hence 
for this case we calibrate K by simply m atching the Lx— T relation 
measured by Allen & Fabian (1998a I. As above, we set the gas 
fraction of all clusters equal to 0.17. We find that if = 1.5 results in 
an Lx — T distribution that best fits the data. The resulting clusters 
have 77200 ~ 28. An example of such a profile is shown in Figs. ^ to 
^as dot-dashed curves, for comparison with the solid curves (7=1 
and 77200 = 10). Notice that although the two density profiles have 
different shapes, they roughly follow each other and intersect at 
two points. [The higher value of 77200 = 28 merely implies that 
the temperature at r2oo is lower by a factor of 2.8 compared to the 
77200 = 10 case.] 

Since 7 is fixed for both types of default profile, it is not hard 
to show that 77200 is a function of the NFW concentration c only. 
We find that it is only a weakly increasing function of c in both 
cases. Since the model clusters have a relatively small scatter in c 
and 77200, they are close to self-similar when heating is absent. 



Figure 4. As Fig. bl but showing 'entropy' profiles, given by the expres- 

2/3 

sion T / Pg' . Most of the profiles are quite steep, but increasing 7 (dashed 
curves) results in much flatter entropy profiles. Isentropic profiles are ob- 
tained when 7 = 5/3. 

4.2 Profile selection: the heating models 

4.2.1 The default profile 

The first step is to determine the default profile. In a 2-parameter 
family of gas profiles, a profile can be specified by the value of 
i?gas and one further constraint. We shall consider two different 
constraints for selecting default profiles: 7 = 1 or 7 = 1.2, de- 
pending on the heating model. The former yields isothermal gas 
halos in the absence of heating, and is motivated by its simplicity. 
The latter is motivated by the temperature profiles of X-ray clus- 
ters measured by Markevitch et al. (1998), who approximated their 
results with a polytropic index of 1.2-1.3. For each constraint, we 
need to calibrate the parameter K used in equation ^ 

We calibrate K by matching the model clusters obtained with 
^excess = to the largest observed clusters. We do not attempt 
to estimate K theoretically, as it is our opinion that iSgas depends 



4.2.2 The heated profile 

When excess energy is present, the default profile is modified to 
give the heated profile. We model this in two ways: by decreasing 
77200 while keeping 7 constant, or by increasing 7 while keeping 
77200 constant. The former has the effect of increasing the temper- 
ature at all radii by the same amount (to see this, multiply equa- 
tion|l^by T200 and note that 77200 T200 remains constant). The latter 
steepens the temperature gradient while ensuring that the tempera- 
ture at r2oo stays constant, so that heating is concentrated towards 
the centre. 

Since there are two types of default profile, we have four heat- 
ing models in total. These are summarized in Fig. ^ Models A and 
B have default profiles with 7=1 and Models C and D have de- 
fault profiles with 7 = 1.2. Heating increases 7 in Models A and 
C, but reduces 77200 in Models B and D. 

There are a few loose ends to tie up. If the excess energy is 
so high that L^gas > 0, then the gas is not bound and it does not 
form a halo. However, for Models A and C, the gas halo may still 
be bound when 7 has increased to 5/3. Therefore, to increase iSgas 
further we reduce 77200 instead, as shown in Fig. ra. 
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Figure 5. A schematic diagram of how gas profiles are selected in each 
heating model. The first step is to find the default profile: depending on the 
heating model it has either 7 = lor7 = 1.2; »?200 is then determined 
by requiring that the total specific energy, -C/gas ? 

satisfies (^. (Note that the 
parameter K = 1.2 for Models A and B, and K = 1.5 for Models C and 
D.) The filled circles give only the approximate positions of default profiles, 
since r)200 depends on the NFW concentration c. The heated profile is found 
in the second step: any excess specific energy increases iJgas accordingly, 
and may increase the temperature uniformly (Models B and D), or increase 
the temperature preferentially towards the centre (Models A and C; these 
are modified to accommodate the upper limit of 7 = 5/3 when heating is 
very strong). 



5 THE EXCESS ENERGIES REQUIRED IN X-RAY 
CLUSTERS 

In this section, we present the cluster results obtained with each 
of the four heating models. Since we are concerned solely with 
clusters here, the parameter tq and the star formation model play 
almost no part in the results. (No cold collapse occurs in the model 
clusters for all reasonable values of tq.) 

The simulations are 'low resolution' in the sense that they only 
use the top 10 levels of the collapse tree (section 2.1). Hence, the 
smallest regions have masses of 1.5 x W^^Mq. Each simulation 
used a total of 10000 realisations of the merger tree. W e set the 
gas fraction of every cluster equal to 0.17 (^vrard 1997^ E ttori & 
Fabian 1999) for definiteness. The formulae used to calculate bolo- 
metric luminosity Lx, emission- weighted temperature T and the 
instantaneous mass deposition rate, Al, are given in Appendix A. 
All quantities were evaluated at z = 0. 

One simulation was performed for each heating model, and in 
each case all of the clusters were given a constant excess specific 
energy. For each heating model, we found the excess specific en- 
ergy that best fit the data by matching to the Lx — T relation of 
David et al. (1993) in the first instance. 

The best-fitting excess energy for each heating model is given 



in Table ll. The resulting Lx — T distributions are displayed in 
Figs ^ to ^ The slopes of the distributions given by Models B and 
D are slightly steeper than the observed slope. This suggests that we 
need to relax our assumption of a constant LJcxccss for all clusters. It 
is also evident that the Lx — T distributions flatten slightly at high 
temperatures, ten ding to Lx oc T'^, in ag reement with the largest 



observed clusters (Allen & Fabian 1998a) 



Recall that we calibrated the largest clusters to match the 
Lx — r relation of Allen & Fabian when heating is absent. The 



Table 1. Best fitting values of excess energy for each heating model, ob- 
tained by matching to the Lx — T relation measured by David et al. (1993). 
Excess energy per particle is calculated as {/J-rrin ABgas). 



Heating Model 


Excess Energy (keV/particle) 


A 


1.8 


B 


2.8 


C 


2.2 


D 


3.0 



X— ray luminosity vs temperature 




2xlo' 



Figure 6. Contour plot of the cluster X-ray luminosity-temperature distri- 
bution obtained from Model A, with heating included at the level given in 
Table |l|. The contours are spaced at equal logarithmic intervals. The long 
straight line is the best fit (for bolometric luminosities) taken from David et 
al. (1993). The extent of the line corresponds roughly to the extent of the 
data. 
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Figure 7. Same as Fig. ^but using Model B. 
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X-ray luminosity vs temperature 



Cluster X— ray luminosity function: z ■ 




Figure 8. Same as Fig. p but using Model C. 



X— ray luminosity vs temperature 
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Figure 9. Same as Fig. ^ but using Model D. 



results thus confirm that the largest clusters are least affected by 
the excess energy (see also Fig. 1 in WFN98). However, the hottest 
clusters shown are in fact about a factor of 1/3 less luminous than 
before heating. We do not attempt to correct for this relatively small 
discrepancy. It is possible that in reality ^excess would be more di- 
luted, i.e. smaller than we have assumed, in the largest clusters. 

As expected. Models A and C require less heating than the 
other models, because they concentrate heating towards the centre 
of clusters, where most of the luminosity comes from. In addition. 
Models C and D require slightly more excess energy than Models 
A and B, respectively. Nevertheless, the highest excess energy in 
Table ^ is only about 50 per cent more than the lowest, over a set 
of very different heating models. 

We display the X-ray luminosity function, temperature func- 
tion and mass deposition rate (A/) function from the same simula- 
tions in Figs.jl^, [iTjand |l^, respectively. In each plot we have used 
a different line for each heating model. Superimposed on each plot 
is the observed data, as described in the captions. The same remarks 
regarding the simulated and observed M functions made in the pre- 
vious section apply here. (However, the exclusion of clusters cooler 



0.1 1 
(10*'' erg s"') 

Figure 10. The X-ray luminosity functions given by all four heating models. 
The model results are plotted as follows: Model A: solid line, Model B: 
dashed line, Model C: dot-dashed line. Model D: dotted line. The curve is 
the best-fitting Schechter function for the i?05/ir Brightest Cluster Sample 
bolometric luminosity function (Ebeling et al. 1997). 



Cluster temperature function; z 




Figure 11. The X-ray temperature functions given by all four heating mod- 
els, plotted with the same line styles as in Fig. ^ The straight line is the 
power law fit obtained by Edge et al. (1990). 



than 2 x lO^K has practically no effect on the M functions sim- 
ulated here, for most gas halos below this temperature have been 
unbound.) 

The luminosity and temperature functions obtained with all 
four heating models give good fits to the data. However, the model 
M functions give relatively poor fits. 

Models C and D give particularly poor fits where M > 
lOOM© y"'^. This is because the mass deposition rate of large clus- 
ters are too high in these models. This can be attributed to the flatter 
cores of their gas density profiles. The poor performance of Models 
C and D support the result that the 7 = 1.2 gas profiles are rela- 
tively poor fits to the surface b rightness profiles of l arge clusters 
compared to the "f — 1 profiles (Ettori & Fabian 1999). 

Models A and B show a deficit of clusters with small cool- 
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Cluster cooling rate cumulative function: z - 0.0 
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dM/dt (Mg y ') 

Figure 12. The mass deposition rate functions (plotted cumulatively) for 
all four heating models, plotted with the same line styles as in Fig. iq The 
jagged line is the same function taken from Peres et al. (1997), modified by 
using a cluster age of 6 Gyr 



Table 2. Table of excess energies used with Model B to improve the mass 
deposition rate function, which is shown in Fig. h4l 



Halo Mass (lO^^M©) 


Excess Energy (keV/particle) 


> 246 


2.8 


123 


2.3 


61 


1.9 


35 


1.5 


15 


1.1 



ing flows (A/ — IO-IOOM0 y~^). The main reason for the deficit 
is because the excess energies are now too high for the smallest 
clusters. We have repeated the simulation for Model B using lower 
excess energies in clusters less massive than 246 x 10^^ M©. The 
excess energies are given in Table ^ they increase steadily with 
mass up to 246 x 10^^ Mq. The resulting Lx — T distribution and 
M function are shown in Figs. and respectively. Both show 
a better match to the data than before. The new M function has an 
increased number of small cooling flows, and the new Lx — T dis- 
tribution reaches to lower temperatures (due to the reappearance of 
~ 2 keV clusters, which were previously unbound). 

If it is true that iSoxccss increases with cluster mass, then this 
may be hard to reconcile with heating by supemovae, because 
we then expect iScxcGss to become more diluted with increasing 
halo mass (see section In this case, a significant amount of en- 
ergy injection would have to occur in clusters themselves (possibly 
by AGN). However, we note that this result is somewhat model- 
dependent, for it is possible to avoid it by combining different heat- 
ing models. If large clusters are heated preferentially towards the 
centre (as in Model A) but small clusters are heated more uniformly 
(as in Model B), then it is possible that an excess energy of roughly 
1.8 keV/particle across all clusters could satisfy all the data (see Ta- 
ble [l] and ||). Such a scenario may result from a characteristic scale 
in the spatial distribution of the heat source (supernovae or AGN). 
Alternatively, a strong wind may distribute its energy more effi- 



X— ray luminosity vs temperature 



o 




2xl0' 5x10' 10" axio' 

T (K) 



Figure 13. As Fig. |j, but using excess energies which increase with halo 
mass, as given in Table ^ Model B was used. Previously unbound groups 
now appear at temperatures below 2 keV. 



Cluster cooling rate cumulative function: z - 0.0 
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Figure 14. As Fig. |l2| but for Model B only, using increasing excess ener- 
gies with halo mass as given in Table ^ The number of small coohng flows 
has increased, improving the fit to the data. 

ciently through a small (proto-)cluster, because the cluster is closer 
to being unbound, i.e. it is more disturbed. 

5.1 Using all available gas profiles 

By using all the available gas profiles in the 2-parameter family 
(i.e. independently of any heating model), we have also found the 
minimum excess energy required to put a fiducial cluster on the 
observed Lx — T relation. 

We considered the specific case of a halo of mass 1.23 x 10^ 
Mq, which virializes at z = with a gas fraction of 0.17. Such a 
cluster has a temperature of around 2 keV, depending on the amount 
of heating. To obtain the NFW profile we assumed the same cos- 
mology as before. The problem was structured as follows. We first 
found the locus of points in (77200, 7)-space which put the cluster 
on the observed Lx — T relation. From these points we then found 
the one which had the least excess energy. However, the gas profile 
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Figure 15. Contour plot in parameter space for a fiducial cluster of mass 
1.23 X 10^'' Mq, collapsing at 2 = with a gas fraction of 0.17. The 
dashed curve gives the parameters of gas halos that lie on the best-fitting 
Lx — T relation obtained by David et al. (1993), and gas halos in the 
shaded region lie within the region of uncertainty of this relation. The thin 
contours are labelled by excess energy (keV/particle), measured relative to 
an isothermal default profile (as in Models A and B). The profile that re- 
quires the least excess energy to match the Lx ~ T relation is given by 
7 = 5/3 and 77200 = 26. It has an excess energy of 0.95 keV/particle. The 
thick solid line roughly sweeps out the positions of other possible default 
profiles (see text). 



specified by (77200, 7) only tells us the value of iSgas — to compute 
-Bcxccss we also need the 'default' value of iSgas, i.e. when heating 
is absent. In what follows, we assume that the default value of iSgas 
is given by equation ^ with K = 1.2 (as in Models A and B). 

Fig. ha shows contours of excess energy in parameter space, 
labelled in keV/particle. The gas halo becomes unbound for ex- 
cess energies above 3.1 keV/particle. The dashed curve gives the 
parameters of gas halos that lie o n the best-fitting p ower law to 
the observed Lx — T distribution (David et al. 1993). The shaded 



area contains gas halos that lie within the Icr region of uncertainty 
for this best-fit. Note that it represents the uncertainty in the mean 
properties of X-ray clusters, and should not be confused with the 
dispersion in the Lx — T relation. From the plot, the gas profile 
with 7 = 5/3, 77200 = 26 requires the least excess energy to match 
the best-fit relation. It has an excess energy of 0.95 keV/particle. 
If the shaded region is taken into account, the minimum excess en- 
ergy is roughly 0.7 keV/particle. It should not be surprising that the 
above profile is marginally stable to convection. We 'save energy' 
by concentrating the heating where it makes the most difference, 
i.e. near the centre, but convection limits the extent to which we can 
do this. The gas halo that requires the least heating is therefore the 
one with the isentropic atmosphere. This suggests that the 7 = 5/3 
profile probably requires the least heating among all possible gas 
profiles. 

A similar plot displayed in Fig. |^ shows contours of entropy 
(given by s = T IrSJ'^) at a radius of 0.1r2oo- The entropy varies 



Figure 16. Similar to Fig. [l^, showing also contours of entropy at a radius 
of 0.1r2oo (dotted lines). The contour labels give s = T lr?J''^ in units of 
keV cm^ . 



significantly along the dashed line, from around 200 keV cm'^ to 
600 keV cm^ . The plot shows that the energy requirements are re- 
duced if heating raises the entropy as much as possible. (We dis- 
cuss this further in sect ion 8.2 .) The model entropies may be com- 
pared to the results of Ponman, Cannon, & Navarro (1999), who 
measured the entropies of groups and clusters at this radius in or- 
der to avoid possible cooling flows. However, these authors used 
emission-weighted temperatures to compute the entropy, whereas 
we have used radial-resolved temperatures. When this is accounted 
for, the above range of entropies are all consistent with the data. 

So far we have assumed that in the absence of heating i?gas = 
—3.1 keV/particle, as given by equation ^ using K = 1.2. If we 
use K = 1.5 instead (as in Models C and D), then the default 
value of iSgas becomes 1.5/1.2 x (—3.1) ~ —3.9 keV/particle. 
As this is lower than before, all excess energies are increased by 
0.8 keV/particle. We can generalize further by considering what 
parameters the cluster would need in order to lie on the self-similar 



relation Lx oc T normalized to the largest observed clusters 



(Allen & Fabian 1998a). The gas profiles which satisfy this rela- 
tion are given by the thick solid line in Fig. |l5[ As expected, it 
passes close to the points (77200,7) = (10, 1) and (28,1.2), where 
the default profiles of our heating models are found. Thus the thick 
line roughly sweeps out the locations of possible default profiles. 
By assuming a 7 = 1 default profile in the above analysis, we ob- 
tained the highest default value of iSgas, and therefore the lowest 
possible excess energies. 



6 THE EFFECT OF SUPERNOVA HEATING 

In this section, we investigate the amount of excess energy obtain- 
able from supernova heating. Complete simulations with 20 levels 
of collapse hierarchy were performed with Models A and B. Each 
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Table 3. The values of egN and tq used with Models A and B. 







TO 


Model A 


0.3 


1.0 


Model B 


0.25 


0.4 


Model B 


0.15 


0.1 



simulation used 100 realisations of the merger tree. Below, we be- 
gin by setting the parameters of the galaxy formation model. 



6.1 Setting the model parameters 

There are three parameters that remain to be set. They are the crit- 
ical ratio of cooling time to free-fall time, ro, the efficiency of su- 
pernova feedback, esN, and the boost in the rate of supernovae. 
As mentioned in section 2.3, we assume that supernova rates are 



boosted by a factor of 5 for this work. Intuitively, this should in- 
crease the amount of supernova heating; however, we shall demon- 
strate that the resulting excess energies are quite insensitive to this 
parameter. All three parameters are kept constant in each simula- 
tion. 

We assume an initial gas fraction of 0.27 (section [l]). Unless 
stated otherwise, the resulting X-ray clusters have a mean gas frac- 
tion of 0.17 and a scatter of about 0.01, in agreement with the gas 
fraction used in the previous section. 



6.1.1 Setting esN 

The feedback parameter esN controls the amount of star formation, 
which can be characterized by the fraction of gas turned into stars 
by the present day. Using the Coma cluster as a large sample of 
baryons, the mass ratio of hot gas to stars inside a radiu s of 1.5/i~^ 
Mpc is about 15, assuming h — 0.5 (White et al. 1993). In order to 



match this, we set esN=0.3 for Model A and esN=0.25 for Model 
B. We find that the required value of esN is almost independent of 
the value of ro, unless tq takes an 'extreme' value (~ 10 times 
greater or smaller than 1). In fact, a much larger fraction of baryons 
is converted into BDM than into stars (as can be seen from the 
primordial and cluster gas fractions). Most of the BDM is formed 
in the halos of massive galaxies and small groups. 

6.1.2 Setting tq 

The parameter ro controls the transition from cold to hot collapse. 
From its definition, we know that ro ~ 1. However, we consider a 
range of values: ro = 1, 0.4, and an extremely low value of 0.1, to 
illustrate its effect on the resulting excess energies. Table|hststhe 
three sets of parameters used in the simulations. 



6.2 The excess energies from supernova heating 

For Model A, a scatter plot of excess energies vs. halo mass is dis- 
played in Fig. along with the mean and standard deviation for 
each mass bin. All of the scatter plots in this section were generated 
by randomly selecting up to 100 halos for each mass, regardless 
of redshift (only the most massive halos have less than 100 points 
plotted, because they are so rare). Up to a mass of ~ IO^^Mq, the 
excess energies clearly increase with mass. Above ~ 10^^ Mq, star 




Figure 17. Scatter plot of excess energy vs. halo mass, using Model A. 
Each mass bin contains a maximum of 100 points. Halos were selected 
randomly from the simulation regardless of redshift. The soHd hne gives the 
mean value of each mass bin, and the dotted lines are plotted one standard 
deviation from the mean. 
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Figure 18. As Fig. |l^, but showing the magnitude of the binding energy vs. 
halo mass. Note that the energy is now plotted logarithmically. 



formation gives way to cooling flow behaviour, so that the mean 
excess energy changes little. However, the scatter reduces signif- 
icantly due to an averaging effect. A gradual decrease in excess 
energy can be detected in the most massive halos, due to dilution 
by the accretion of primordial gas. 

The ratio of excess energy to binding energy gives a measure 
of the excess energy's ability to change the gas distribution. Recall 
that we define the binding energy to be equal to |-Bgas| in the ab- 
sence of heating. Fig. [l^ shows a corresponding plot of binding en- 
ergy for the same simulation as above. The ratio of excess energy to 
binding energy is displayed in Fig. [l^. It has a strict upper limit of 
1, above which gas halos are not bound. The distributions of points 
in mass bins below ~ are very similar and lie roughly in 

the range 0.2-0.6. The lowest mass bins are an exception because 
some of their halos have no excess energy at all; this causes the 
mean to dip for the lowest bins. As explained in the caption, this is 
purely an artifact of the finite mass resolution. 

The approximately scale-invariant behaviour below ^ 
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Figure 19. As Fig. ^ but showing the ratio of excess energy to binding 
energy. The dip in the mean for the lowest mass bins is caused by halos 
which have zero excess energy. This can only occur when a halo has no 
progenitors. Therefore, the dip is an artifact of the finite mass resolution. 
Notice that the finite points in the second lowest mass bin already have a 
similar distribution to higher mass bins. 
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Figure 20. The ratio of excess energy to binding energy obtained from 
Model B, using ro = 0.4 and egN=0.25. Note the strong similarity with 
Fig. [I 



can be understood as follows. Below a certain halo mass, 
almost all of the galaxies produce sufficient supernova feedback to 
eject their gas. In addition, the gas is always ejected with excess 
energy equal to the binding energy of the host halo (for the model 
assumes that the gas is ejected at the escape velocity). As a re- 
sult, for halos in, or slightly above, the said mass range, the ratio 
of excess energy to binding energy simply reflects the ratio of the 
binding energies of its progenitors to itself (ignoring the dilution 
of excess energy by primordial gas for simplicity). The similarity 
in the distribution of points in each mass bin (below ~ lO^^M©) 
simply implies that these ratios do not change much with mass. 

For halos > IO^^Mq, the ratio drops dramatically due to the 
cessation of star formation. Above IO^^Mq — in the regime of X- 
ray clusters — the excess energies have hardly any effect on the gas 
halos. 

Since the behaviour shown in Fig. is largely due to the bind- 
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Figure 21. The ratio of excess energy to binding energy obtained from 
Model B, using tq = 0.1 and egf,j=0.15. The lower value of ro causes 
star formation to cease at lower masses, resulting in a notable difference 
from Fig. 



ing energies of halos, it should depend little on the heating model. 
Fig. |2c| shows the corresponding plot for Model B, with the param- 
eters To = 0.4 and esN=0.25. As expected, it is almost the same as 
for Model A. However, a difference does occur if ro is reduced fur- 
ther. For Fig. |2l|, we used the parameters ro = 0.1 and esN=0.15 
with Model B. In this case, star formation is restricted to much 
smaller halos, so that the decline is also shifted to a lower mass 
scale. (A side effect is that more gas is lost in cooling flows, so that 
the gas fraction of clusters is 0.15 instead of 0.17). This scenario 
is unlikely to occur in reality, not only because we expect ro ~ 1, 
but also because the characteristic luminosity, L,, of the luminosity 
function of galaxies (fitted with a Schechter function) would be too 
small. 



6.3 More on heating clusters with supernovae 

The excess energies obtained above are clearly too low to satisfy 
the energy requirements of X-ray clusters (section The relation- 
ship between excess energies and binding energies also suggests 
that it would be difficult to increase the amount of heating signif- 
icantly in this model. Indeed, we find that the excess energies of 
clusters are not sensitive to esN, nor the supernovae rate per unit 
star formation. For example the parameters esN=O.I and esN=l-0, 
used with Model A, give virtually identical excess energies in clus- 
ters to those shown in Fig. |l7| — indeed, the rest of the plot is hardly 
modified. If instead we remove the factor-of-5 boost in supernova 
rates (implying a change in the IMF), the excess energies of clusters 
are only reduced from around 0.05 to 0.03 keV/particle. 

Expanding on the previous section, the excess energy of a 
cluster is essentially determined by the binding energies of the 
most massive progenitors in its merger tree (looking backwards in 
time along each and every branch) to produce type II supernovae. 
Although these progenitors might not be able to eject their atmo- 
spheres, they are still likely to leave the gas with iScxcoss close to 
the binding energy. Furthermore, the extent to which JSexcess is di- 
luted by primordial gas in the final cluster is also mainly a function 
of the merger tree. The net result is that changing esN or the IMF 
has little effect on the excess energy of clusters. What they do af- 
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feet is the amount of gas converted into stars: tiie more efficient tlie 
supernova feedback, the less stars are formed. 

The above suggests that we can increase the excess energies 
of clusters by increasing tq. We find that by increasing to from 1 
to 3, the transition from star formation to cooling flow behaviour is 
shifted to halos that are roughly 4 times more massive. As a result, 
-Eexcesa in clustcrs increases from around 0.05 to 0. 12 keV/particle. 
This agrees very well with a simple scaling argument: since bind- 
ing energy scales roughly as M'^^^ , the 4-fold increase in the mass 
scale of the transition region implies that iScxccss should increase 
by a factor of 4^^^; this is indeed the case, but the increase is clearly 
too small. 

6.3.1 The simulated iron abundances 

The clusters shown in Figs. [l|and|| have an iron abundance of 
about Q.Q^ Zp). Although this is lo wer than the observed range of 
0.2-0.3^0 ( [pukazawa et al. 1998[ ), we reiterate that this is not the 
reason for their low excess energies. Like the gas-to-stellar mass 
ratio, the iron abundance can be controlled by the parameter esN- 
For example, reducing esN by a factor of 3 increases both the stellar 
mass and the iron abundance of clusters by about a factor of 3. 

The large number of type II supernovae per unit stellar mass 
required to enrich cluster ga s to the observed me tallicities has been 
discussed by other authors dAmaud et al. I992|; Elbaz, Amaud, & 
Vangioni- Flam 1995; [Brighenti & Mathews I999| ). It is possible 
that a large fraction of the iron in clus ter gas is due to type la s uper- 
novae, which we have not included. Vagataki & Sato (1998) sug- 
gest that between 30-90 per cent of the iron in X-ray clusters may 
be due to type la supernovae. It is also possible that the observed 
metallicities (which are emission-weighted) overestimate the aver- 
age metallicities of cluster gas, due to the existence of stee p metal- 
licity gradients (Ezawa et al. 1997; Allen & Fabian 1998h| ). 



(ii) If the gas pressure outside the proto-halo is raised signifi- 
cantly due to heating, then the work it does on the proto-halo may 
need to be included. 

(iii) In any progenitor halo that contains hot gas, work is done 
(by the gas remaining) on gas that cools out near the centre. This 
has the effect of reducing iSoxccss . 

(iv) Gas that is converted to stars and BDM is generally located 
in positions of minimum potential. Removal of this gas may there- 
fore increase the mean energy of the gas that remains. 

The mechanisms have been listed in order of increasing sophisti- 
cation in the arguments required. We consider each of them below 
and attempt to quantify their effects on -Ecxccss- We also give a 



more formal definition of Ec- 



; and discuss the evolution of Eg, 



in some detail. For definiteness, we shall base our discussion on the 
proto-halo of a cluster, but it can be generalised to smaller halos. 

Quite aside from the effects mentioned above, there remains 
the possibility that when the excess energy is large, some of the gas 
associated with a DM halo may extend beyond the virial radius. 
Also, there is some uncertainty in the efficiency with which gas that 
is ejected from a halo recollapses into larger halos. We assumed that 
such effects are small in our model. 

If the heating of proto-cluster gas is very uneven, e.g. if the gas 
is heated by the radio jets of AGN, then the main effect may be to 
unbind part of the intracluster medium. In this case, smaller clusters 
would have lower gas fractions than larger clusters. However, in 
order to match the observed Lx — T relation, the excess energies 
would still need to be very high. The X-ray luminosity of a 2 keV 
cluster is an order of magnitude below the self-similar prediction 
(see e.g. Fig. 1 of WFN98). Since Lx scales as the gas density 
squared, we would need to unbind 2/3 of the gas to reduce Lx by 
an order of magnitude (assuming that the shape of the gas density 
profile remains unchanged). The excess energy averaged over all of 
the gas is then ~ 2/3 of the binding energy of the cluster. 



7 LIMITATIONS OF THE MODEL 



7.1 The 'gravitational' contribution 



In our model, we make the approximation that the excess specific 
energy of a gas halo is equal to the total energy injected over the 
history of the gas. i.e. 



E, 



VAVAt, 



(12) 



where Mgas is the mass of the gas halo and T is the net heating 
rate per unit volume. In general, V thus includes heating by super- 
novae and active galactic nuclei, and accounts for the energy lost 
through radiative cooling. We refer to F simply as the rate of non- 
gravitational heating. The volume integration is made over all of the 
gas that eventually forms the gas halo, therefore the volume itself 
is irregular and varies with time. 

However, there are mechanisms other than V that can affect 
the final value of -Bcxccss and hence warrant at least a mention. 
In what follows, we shall consider a single halo and the evolution 
leading up to its virialization. We use the term 'proto-halo' to refer 
to the contents of this halo at all times earlier than the virialization 
time (note that the proto-halo is not itself a halo, but it can contain 
progenitor halos). 

Briefly, the mechanisms are as follows. 

(i) If the evolution of the gas distribution (which otherwise 
traces the DM distribution fairly well) is modified significantly by 
non-gravitational processes, then there can be a 'gravitational con- 
tribution' to iScxccss- 



We begin with a simplified scenario in which no gas is converted 
into stars or BDM in the proto-halo. We generalize the definition of 
_Egas (equation ^ to apply to the proto-halo at any time, by includ- 
ing the kinetic energy of bulk motion: 



E, 



Ps 



(13) 



where v is the velocity of the gas and the volume of integration 
is as explained above. At early times, the proto-halo occupies a 
roughly spherical region; it later condenses into sheets, filaments 
and halos. As a first approximation, the potential can therefore 
be calculated from the mass distribution of the proto-halo, ignoring 
all matter outside it. (Using a larger region to calculate (j) does not 
affect our argument, but this simplifies estimates of £'gas(t).) We 

set </> = at infinity. 

In Appendix B (equation B 12 1 we show that -Egas obeys 



1 



dt 



dt 



(14) 



where we have assumed that the gas pressure at the boundary of the 
proto-halo is negligible. This implies that the rate of change of iJgas 
is given by the net rate of non-gravitational heating, plus a weighted 
average of Since (j) is dominated by the contribution from DM, 
we shall make the approximation throughout that 4> is unchanged 
by modifications in the gas distribution. This leads to the important 
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Figure 22. A schematic diagram of the evolution of Bgas with time. The 
times tf^,ta and give the time of energy injection, the turnaround time 
and the virialization time respectively. The lowest curve gives the evolution 
ofE gas in the absence of non-gravitational heating or cooling. -Ecxccss is 
defined as the deviation from this curve at i„. The other two solid curves 
show the effects of injecting a small and large amount of energy. In the 
latter case, there is a 'gravitational contribution' to the excess energy, given 
by the difference between the solid and dashed curves at f„ . 



observation that the gas processes which drive F do not have an 
immediate effect on the other, gravitational term. (This would not 
be the case if, for instance, that term included instead of pg.) 
This allows us to consider the two terms on the rhs separately. 

In the absence of non-gravitational processes (implying F — 
0), we expect iJgas to increase as the system expands, and to de- 
crease after the turnaround time, ta- The final value of -Egas, at 
the virialization time ty, is given by equation i in our model. A 
schematic diagram of this is shown in Fig. ^ . Equation ^ itself 
simply expresses how we expect iJgas to scale in the absence of 
non-gravitational processes. 

The formal definition of iScxccss is thus the difference between 



the actual value of iSgas at t„ and the value obtained in the absence 
of non-gravitational processes. Now suppose the inclusion of non- 
gravitational heating does not modify the gas distribution at all. 
In this case, the gravitational term in equation |l^ is not affected. 
i?oxcess is then given by equation [l^ exactly. This is illustrated in 
Fig. by a single, small injection of energy at time th - The subse- 
quent evolution of _Egas is unchanged. 

If the energy injected is large (comparable to |i5gas|), then it 
can make the gas distribution more extended in the potential well of 
the proto-halo. This is likely to reduce the magnitude of the grav- 
itational term in equation 1141, because more weight is given to ar- 
eas of smaller j<^j, where \d(f)/dt\ is also likely to be smaller. The 
net change in iSgas between energy injection and tv is therefore 
reduced. This is illustrated by the solid curve resulting from the 
large injection of energy in Fig. ^ Its deviation from the dashed 
curve (which would describe Bgas if the gas distribution were not 
modified) leads to an excess energy that is larger than the energy 
originally injected. We refer to the difference between the solid and 
dashed curves at t„ as the 'gravitational contribution' to iScxccss- In 
general, the gravitational contribution is given by 



(Pg 



Pi,,G)^dVdt. 



(15) 



The above argument suggests that provided -Egas,G(^h) > 
£'gas,G(iti), the gravitational contribution associated with a large 
injection of energy is likely to be positive, especially if the above 
inequality is large. From Fig. we can see that this is no longer 
true if th is earlier than some time ti, given by i?gas,G(ii) = 
Eg!is,G{tv)- However, a rough estimate of ti gives ti = 0.09t„ 
(using the spherical collapse model and assuming that the radius 
of the system at ti is equal to the virial radius). It therefore seems 
likely that most of the heating would occur after ti. 

In general, the gravitational contribution to -Ecxccss is difficult 
to estimate, and would require modelling with hydrodynamic sim- 
ulations. It would depend on the total energy injected, and when it 
was injected (on average). It would probably be stochastic as well. 
This is reminiscent of the 'bias parameter' used to relate the clus- 
tering of galaxies to the clustering of DM, suggesting that perhaps 
the exact value of -Ecxccss can be related to the energy injected by 
some bias parameter. 

From Fig. ^ the maximum possible gravitational contribu- 
tion would in principle be given by injecting sufficient energy at ta 
to raise -Egas to almost zero. Assuming that the gas so dispersed 
that ^ ~ 0, the gravitational term in equation |l4 then vanishes. 
Hence, _Egas ~ at , and Ecxccss would be greater than the en- 
ergy injected by Emax ~ [i^gas.c]?"- The axis of Egas in Fig. H 
has been marked with intervals of E^max, according to rough es- 
timates of the absolute values of Sgas.o at ta and tv (derived in 
Appendix C). In this idealized example, E^cxccss is therefore ~ 50 
per cent greater than the energy injected. Such an increase could 
assist in breaking the self-similarity of clusters. 

7.2 Work done at the outer boundary 

If gas in the proto-halo does work on gas outside, we would ex- 
pect this to reduce E^gas, and vice versa. Conceptually this is quite 
simple, but we need to follow the gas in more detail than before. 
We introduce the term 'proto-gas halo' to refer strictly to the gas 
which eventually forms the gas halo. (Thus it does not include gas 
that is converted into stars or BDM before virialization. We do not 
explicitly account for gas that is recycled from stars, which should 
be only a small fraction of the intracluster medium.) The mass of 
the proto-gas halo is thus constant with time. 

To distinguish this from our earlier discussion, we introduce 
egas to give the total specific energy at a position comoving with 
the gas: 



3kT 1 2 
2fimu 2 



(16) 



Integrating this over the mass of the proto-gas halo gives the total 
energy, J Cgasdm = MgasE^gas- In Appendix B, we show that 



d 
dt 



J egasdm = J (-Pv+Tv)-dA+ J (^p^^+T 



dv,(n) 



Here and below, we use a subscript 'G' to imply the same system 
evolved without including non-gravitational processes. 



The only change from equation [l^ is the additional surface inte- 
gral, in which P is the gas pressure, T is the viscous stress tensor 
and dA is a vector element of surface area. The surface integral 
gives the rate of work done by the proto-gas halo on other gas. The 
viscous term is almost certainly negligible for our purposes, so we 
assume that it vanishes. The work done at the outer boundary of the 
proto-gas halo is then a straightforward integral of PdV. 

Part of the motivation for estimating this is because, if the gas 
is heated when it is diffuse, then it is conceivable that the work done 
by compressing the proto-gas halo would boost the final excess en- 
ergy (due to the increased pressure). Note that it is the change in 
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work done as a result of heating that we are interested in. Since 
hydrodynamic simulations which do not include non-gravitational 
processes result in almost self-similar X-ray clusters, we can be as- 
sured that any work done does not prevent them from following a 
self-similar energy equation such as (Q). For simplicity, we shall 
consider the total work done after the turnaround time, to see if this 
effect can increase iSoxcoss • 

We expect most of the work to occur on those parts of the outer 
boundary which form the ends of filaments and, possibly, the edges 
of sheets, because density and temperature are highest at these sur- 
faces. Although the rest of the outer boundary has a much larger 
area, we shall assume that the pressure there is so small that the 
work done there is no more than that at the end of filaments. For 
filaments, the volume swept out by the end surfaces should be com- 
parable to the volume of the filaments. This is because infall occurs 
along the filaments in general. Using the spherical collapse model 
for comparison, the volume of the sphere at turnaround is 8V200, 
where V200 is the volume of the virialized halo. Let there be an ef- 
fective pressure P^s, then the work done on the proto-gas halo be- 
tween the turnaround time and virialization is 7/eff V2oo-Peff, where 
7/cfiV2oo is the effective volume swept out by the said surfaces. 
Letting Pcff = Pg,cHkTcs/{fi7nn), where we have defined an ef- 
fective density and temperature, the work done is then given by 



7/efI 1^200 — 7 Jeff fcicff 

/iTTiH Pg \ fJ-mn 



(18) 



where Pg is the mean density of the virialized gas halo and 
Mga_s/ f^rnu is the number of particles in the gas halo. It follows 
that the contribution to _Egas is given by 



7/cft^f^)keV/particle. 
Pg \ keV / 



(19) 



The volume filling factor of filaments, which we use to ap- 
proximate /cff, naturally depends on the threshold density above 
which we define our filaments. From hydrodynamic simulations of 
the IGM in a Cold Dark Matter Q — 1 cosmology (Zhang et al. 
1999), threshold overdensities of about 1 to 5 (relative to the back- 
ground baryon density) result in filamentary structures, but higher 
than ~ 10, the structures obtained become dominated by knots 
rather than filaments. Most of the filamentary structures also ap- 
pear to be in place by z — 5, and exhibit mild evolution after that 
(Zhang et al. 1998). We shall use a fiducial value of fcs ~ 0.01 



and a fiducial overdensity of 10. In an f2 = 1 universe is 200 
times the background density. In the simulation, the filaments have 
a typical temperature of ~ 10"'^ keV. We thus obtain a fiducial 
value of 7 X 0.01 x 0.05 x 10"^ = 3.5 x 10^'' keV/particle for 
the work done in the absence of heating. This is clearly negligible. 

If all of the gas in the filaments is heated strongly to a temper- 
ature of ~ 1 keV, then the gas halos within would also be flushed 
out. This would momentarily increase the effective density of the 
filaments. Substituting the values /9g,cff/pg = 1/3 and kTcft — 1 
keV, the work done becomes 0.02 keV/particle. In reality, the gas 
would continue expanding out of the filaments, so that the volume 
filling factor /eft would increase. Assuming that the gas expands 
adiabatically, Pg.effToft oc P^s oc 1//^, where 7 — 5/3. This 
suggests that the actual work done would be less than the above es- 
timate, and therefore <^ 1 keV/particle. The caveat is that we have 
not accounted for gas that is more diffuse than the filaments, which 
may also be heated to ~ 1 keV/particle. 



7.3 Work done on hot gas that cools 

In this and the following section, we consider how the conversion 
of gas into stars and BDM inside the proto-halo may affect the total 
energy of the gas remaining. 

If a progenitor halo contains hot gas which cools out, then 
work may be done by the proto-gas halo on the gas that cools. (In 
cold collapses this should be very small, as the gas is in general 
not pressure-supported.) This would reduce the total energy of the 
proto-gas halo. However, the gas remaining started out at larger 
radii; therefore it had a higher than average potential energy before 
cooling started. This effect is discussed more generally in the next 
section, and it works in the opposite direction. The net effect can 
be investigated with a spherical hydrodynamic simulation of a hot 
gas halo which cools. 

Here, we describe a simple way to obtain an upper limit on the 
work done, using our simulations. Suppose the proto-gas halo has 
an inner surface or 'bubble' that lies inside some progenitor halo 
that contains hot gas (the 'bubble' is likely to quite irregular). Let 
Pg and T be the density and temperature at this surface. Then the 
work done as the bubble shrinks is J PdV — J kT/ (/.imn) PgdV. 
We shall assume that the gas halo is isothermal. Now, PgdV at the 
bubble wall is smaller than the mass of the corresponding gas that 
cools out, because the latter had an initial density greater than pg. 
It follows that if we replace PgdV in the integral with dm, the 
mass of gas that cools, then we would overestimate the work done. 
Hence kTmBDM/ifJ-mu) gives an upper limit on the total work 
done, where ttibdm is the mass of hot gas converted into BDM in 
our simulations. (Note that it does not actually matter whether the 
gas is converted into BDM, stars, or a cold disk.) 

However, if all of the hot gas in the progenitor halo cools out, 
then the 'bubble wall' must lie outside the gas halo, where the pres- 
sure is probably negligible. This suggests that we should not count 
such cases at all. 

Over the history of the proto-gas halo, the total work done 
on hot gas that cools is thus < ^ kTmBDM/{^^rnH), where the 
summation is made over all progenitor halos that did not cool out 
all of their hot gas. The reduction in -Bgas is therefore less than 



1 



^msDM ) keV/particle. 



(20) 



We computed this quantity using Model B (i.e. only isothermal gas 
profiles) and both sets of parameters given in Table ^ For small 
clusters (T « 2 keV), we obtain around 0.25 keV/particle, with a 
scatter of 50 per cent each way. For large clusters (T ~ 10 keV) the 
upper limit is about double this. The two simulations gave similar 
results. 

[We note that the bubble is just an imaginary surface for sepa- 
rating different subsets of gas. If heating (in the form of F) occurs 
inside a bubble, gas outside can still be heated via the surface term 
in equation |l^. For most purposes, the distinction is best ignored.] 

7.4 The effect of gas removal 

Having developed the machinery to follow clumps of gas individu- 
ally, it is natural to ask whether the spatial distribution of the proto- 
gas halo can itself result in excess energy. This becomes clear if 
we consider the proto-gas halo at very early times. Its outer bound- 
ary is then almost spherical, but it would contain many 'bubbles' 
inside, as described above. If the bubbles occur preferentially to- 
wards the centre of the sphere, then the gas would have positive ex- 
cess energy, because fractionally more gas would be found at larger 
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radii and higher potentials than in a uniform distribution. Again, 
we are comparing to the case without non-gravitational processes, 
for which the proto-gas halo is just a uniform sphere at very early 
times. In section 8.4, we suggest how a positive excess energy can 



occur in this way, and make simple estimates of its magnitude. 

To estimate the excess energy, it is easier to make compar- 
isons when the halo has virialized, because the time evolution of 
J egasdm is complicated. Consider a virialized gas halo obtained 
without cooling: only a subset of its gas particles would remain 
in the gas halo if the system were evolved with cooling included. 
If this subset has a more extended distribution than the entire gas 
halo, then the subset would have a positive excess specific energy. 
Assuming that the gas is isothermal for simplicity, -Ecxccss can be 
estimated by comparing iJgas for the subset with that for the entire 
gas halo. 

We note that the above method actually overestimates -Ecxccss, 
for it does not account for the work done on the cooling gas, and it 
probably overestimates the gravitational contribution in the follow- 
ing way. Since we compute i?gas after virialization, the evolution 
of / Bgasdm for the subset of gas particles is already accounted 
for. Since the subset is more extended than the whole, there is 
likely to be a positive gravitational contribution. However, in real- 
ity gas belonging to the subset would gradually fall to smaller radii 
to replace cooled gas. The method does not account for this and 
therefore overestimates the gravitational contribution. If a hydro- 
dynamic simulation of a cluster is performed with cooling, all these 
effects would be naturally accounted for. In this case, iJcxccss could 
be computed exactly by comparing with the same cluster evolved 
without cooling. 



8 BREAKING THE SELF-SIMILARITY OF CLUSTERS 

In section|5[ we showed that excess energies of about 1 keV/particle 
or more are required to match the properties of X-ray clus- 
ters. However, we found that our model generates only ~ 0.1 
keV/particle from supernova heating. Nevertheless, the excess en- 
ergy deduced from the iron abundance of X-ray clusters can be as 
high as 1 keV/particle (WFN98). To obtain this result, we made 
two crucial assumptions: that most of the iron originated from type 
II supemovae (SNII), and that a large fraction of the supernova 
energy — we assumed 4 x 10^" erg per supernova — is retained. 

Unfortunately, the first assumption is already in doubt. A re- 
cent analysis suggests that SNIa supply 30-90 per cent of the iron 
in cl usters, depending on the supernova model (N agataki & Sato 
1998). Recall that the same amount of iron contributed by SNIa 
corresponds to ~ 10 times less energy. As for the supernova en- 
ergy that is retained, Thornton et al. (1998) have made a system- 



atic study of supernovae exploding in cold gas (1000 K) in a range 
of gas densities and metallicities. They find that in the late stages 
of evolution, the supernova remnants have total energies of about 
(9-30) X 10'*^ erg (they assumed initial energies of 10^^ erg per 
supernova). We note that if the supernova rate is sufficiently high 
that remnants overlap before going radiative, then the heating ef- 
ficiency may be higher in reality. We are therefore unable to rule 
out supernovae as the source of the required energy, based on the 
present data. However, it is our opinion that this scenario is only 
marginally acceptable. 

The purpose of this section is to move beyond the confines of 
our model, and discuss other possible approaches to breaking the 
self-similarity of clusters. 



8.1 Supernova heating 

Assuming that all the excess energy can be provided by supemovae, 
we consider the basic properties that such a model would need to 
have. First of all, it is clear that a large fraction of the iron in clus- 
ters would have to come from SNII. To obtain enough supernovae, 
most of the stars observed in present day clusters would need to 
be formed with a flattened IMF: for example, boosting the stan- 
dard supernova rate by a factor of 5, and assuming a gas-to-stellar 
mass ratio of 15 (the same parameters as in section n), gives an 
iron abundance of Zsnii = O.ISZ© provided all of the iron is de- 
posited in the intracluster gas. This corresponds to 1 keV/particIe if 
we set esN=l-8. (Since this is already very high for esN, we would 
not want Zsmi to be much lower.) Note that practically all SNII 
would have to have such a high heating efficiency, therefore most 
star-forming galaxies would have to be involved in the heating pro- 
cess. 

We showed in section ^ that the main obstacle to obtaining 
higher excess energies in our model was the assumption that gas is 
ejected at the escape velocity of host halos. In order for gas heated 
by supernovae to escape from a halo with much greater than the 
escape energy, it needs to find a clear path out of the halo. Unfortu- 
nately, this is difficult if the site of star formation is surrounded by 
a hot gas halo, and a continuous infall of cooling gas may also be 
problematic. If a clear path is not found, then gas surrounding the 
site of star formation would be heated gradually; it would leave the 
heat source as soon as it had sufficient energy to escape the halo, 
hence it would be ejected with no more than the escape energy. 
Since halos > lO^^M© generally contain hot gas, we shall sup- 
pose that the heating occur red in less massive halos. In a ddition, we 
show in a separate paper (Wu, Fabian, & Nulsen 1999) that most 



of the gas associated with halos in the range ~ 5 x IO'^^-IO^Mq 
must lie outside their virial radii, therefore it is clear that at least 
some of the heating must have occurred in less massive halos. The 
gas would therefore be ejected with excess energies > 10 times the 
binding energy of these halos (see Fig. |^. To do this, supernovae 
would need to carve out 'chimneys' in the surrounding gas, for the 
hot gas to escape from. This could be made easier by delaying star 
formation until most of the gas has settled into a cold disk, e.g. by 
magnetic pressure, turbulence and/or angular momentum support. 

To keep radiative loss to a minimum, gas needs to be rapidly 
heated to very high temperatures (~ 1 keV) and then ejected. Gas 
that is not ejected must not receive much of the energy, as the cool- 
ing times of galaxies are relatively short. There is also the problem 
of dilution: if only a certain fraction of the intracluster medium is 
heated in this way, then gas would need to be ejected with corre- 
spondingly higher excess energies. 

A side effect of this scenario is that the ability of supernovae 
to regulate star formation would be greatly diminished. Since most 
of the energy must be channelled into gas that is ejected, the main 
role of supernovae is to regulate the quantity of cold gas that is left 
in the halo. However, for the same amount of supernova energy, 
the amount of gas that is ejected is < 1/10 of that in a more con- 
ventional model which assumes gas is ejected at about the escape 
velocity. Therefore, another more effective regulator of star forma- 
tion would be required. Otherwise, the bulk of star formation would 
occur in the smallest halos, and more massive halos would become 
very gas-deficient. One possibility would be to assign a long time- 
scale for star formation in cold ga s, following other SAMs (K auff- 
mann. White, & Guiderdoni 1993; |Cole et al. 1994|; Somerville & 
Primack 1998), though the published time-scales would need to be 
increased. 
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Is it possible for supernova heating to continue in tlie hot gas 
halos of groups? Observations suggest that 10-20 per cent of cold 
gas deposited in cooling ^owsjna^fonn_stars_(Car Gorgas, & 
Aragon-Salamanca 1998). In addition, the binding energies of ha- 
los > IO^^Mq in mass are around 1 keV/particle or more, so it 
would be possible to reach the required excess energies in these ha- 
los without ejecting their gas. We shall briefly discuss some of the 
difficulties with this model. Firstly, gro ups are gas-poor compared 
to clusters ( Wu, Fabian, & Nulsen 1999| ), so it is unclear how much 
gas would be available to form stars, especially as the cooling times 
are increased by the low gas density. Secondly, gas cools and forms 
stars gradually in a cooling flow, so that supernovae would heat 
surrounding gas which is in the process of cooling. It is therefore 
unclear how efficiently supernovae can heat the gas that does not 
cool, since the heating may simply slow down the cooling flow. 

In any case, the model described is already tightly constrained 
by the present data, so it can be tested in several ways with future 
observations. Spatially-resolved spectral analysis will allow us to 
properly measure the average metallicity of intracluster gas. Most 
of the present measurements are emission-weighted, which would 
overestimate the average metallicity if a negative metallicity gra- 
dient is present. Better estimates of the SNIa contribution to the 
iron abundance may also rule out the above model. Our own re- 
sults suggest that if X-ray clusters with T ~ 2 keV turn out to be 
isothermal, then their excess energy should be about 2 keV/particle, 
instead of 1 keV/particle (see Fig. |^. Spatially-resolved temper- 
ature and density profiles would therefore further constrain the en- 
ergy requirements and therefore the models that are allowed. 

The above discu ssion may be altered if hypernovae releas- 



ing ~ 10 erg each (Iwamoto et al. 1998) were common. Since 



the progenitors of hypernovae are believed to be stars of mass 
> 40Mq, such a scenario would still require an IMF strongly bi- 
ased towards very massive stars. 



8.2 Pre-coUapse gas at high entropy 

Thus far, we have used the total energy of a gas halo as the main 
constraint on its structure. In this section we shall discuss a differ- 
ent constraint, namely the gas entropy, which we measure with the 
quantity s — T /v?J^ . 



It was proposed by Kaiser (1991) and Evrard & Henry (1991) 
that a better match to the Lx — T relation could be obtained if 
the I GM was 'preheated' to a high en tropy prior to collapse of the 



gas. Navarro, Frenk, & White (1995) used this method in hydro- 
dynamic simulations of three clusters, using a gas fraction of 0.1 
in an Einstein-de-Sitter universe. By giving all gas particles a uni- 
form high entropy at a redshift of z = 3 (no radiative cooling was 
included), they were able to obtain clusters that closely followed 
Lx oc T^. 

More recently, Ponman et al. (1999) have measured the en- 
tropy of gas in clusters at one tenth of the virial radius (to avoid 
possible cooling flow regions) and found that the entropies mea- 
sured in poor clusters and groups were higher than predicted as- 
suming self-similarity. Instead, the entropies appeared to settle on 
a lower limit or 'floor' given by T/ne ^ 100/i ' keV cm . 
This suggested that perhaps all of the gas had been preheated to 
this entropy, so that outside any cooling region, the entropy would 
have at l east this value (since shock hea ting always increases the 
entropy). Balogh, Babul, & Patton (1999| ) investigated this idea by 
assuming that the preheated gas evolves adiabatically. Using an ini- 
tial entropy consistent with the observed 'entropy floor', they found 



that the isentropic model could fit the properties of groups (T < 1 
keV) but could not match the properties of clusters. This was at- 
tributed to the need for accretion shocks to raise the entropy further 
in clusters. 

We have stressed that clusters need to have sufficient excess 
energy in order to match the Lx — T relation. We therefore argue 
that preheating the gas to an entropy floor alone would not solve 
the problem unless the excess energy is sufficiently high. However, 
this may turn out to be a superfluous point, since creating an en- 
tropy floor probably requires large amounts of energy anyway (see 
below). We also note from Fig. [l^ that the gas profile which re- 
quires the least excess energy to match the data also has the highest 
entropy near the centre. Therefore it clearly helps if we try to raise 
the entropy as high as possible. 

The energy required to raise the entropy to a certain level de- 
pends very much on the density of the gas. Since the IGM is very 
diffuse in places, it seems that a relatively high entropy can be 
achieved with very little energy. However, the difficulty is heating 
most of the gas in the proto-cluster in this way, especially the gas 
which eventually forms the core of the cluster. This is because the 
minimum density experienced by the gas is limited by the overden- 
sity that led to the cluster in the first place (as well as smaller scale 
density fluctuations). The spherical collapse model gives a simple 
illustration of this constraint. 

To estimate the 'advantage' of heating the gas at low density, 
we need to compare the density at the time of heating to the final 
density of the gas, i.e. we need to estimate the compression ratio. 
Using the spherical collapse model, the turnaround radius of the 
sphere is twice the final virial radius, so that the mean density of 
the sphere has a minimum equal to 1/8 of the mean density of the 
virialized halo. This simple model suggests that adiabatic compres- 
sion can increase the temperature of preheated gas by a factor of 4 
at most. 

Alternatively, we can compare the minimum density obtained 
above to a fiducial density of ric — IQ^^ cm^"^ near the centre of a 
cluster (above which cooling can significantly modify the entropy 
during the life of the cluster). The mean gas density at turnaround 
for a halo that collapses at time t is given 

where /gas is the gas fraction. This implies an electron density 
of no,min = 2.0 X 10~Vo.2/t?o cm"^ where tw = t/{10^° 
years) and /0.2 = /gas/0.2. Thus the temperature increase when 
this gas is compressed to density Ue = 10""* cm~^ is a factor of 
13tto^(?i-3//o.2)^''^, where n_3 = ne/(10~^ cm*^). In reality, 
this factor is much reduced by clumping of the gas into filaments 
and sheets by the turnaround time, so that typical value of nc,min 
should be much higher than we have estimated. In addition, the 
central region of the cluster, with roughly the fiducial density of 
Tic = 10~^ cm~'^, almost certainly virialized at an earlier time as 
a less massive halo. If this virialization time is used in tio and fil- 
amentary nature of the gas at turnaround is accounted for, then the 
temperature increase due to adiabatic compression is only a factor 
of a few. 

To obtain more precise estimates, we need to use a hydrody- 
namic simulation. Assuming that the temperature increase is a fac- 
tor of 4, the isentropic profile in Fig. hq which requires the least 
excess energy has a temperature of just over 3 keV at 0.1r2oo. This 
would therefore imply a temperature of (3/4) keV before compres- 
sion, or a thermal energy of (9/8) keV/particle. 
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8.3 Heating by active galactic nuclei 

Energetically speaking, the total energy released in the formation of 
massive black holes at the centres of galaxies is sufficient to heat all 
the baryons in the universe to very high excess energies. However, 
the mechanism for injecting this energy into the gas is uncertain: 
this may occur through jets and winds, but the energy released in 
this form is not well known. On the other hand, the energy released 
as radiation is relatively w ell measured. 

tinsslm et al. have estimated the total energy released 

by black hole formation in the Coma cluster They assumed a mass- 
to-light conversion rate of e « 0.1 and roughly the same rate of 
energy release in relativistic particles and magnetic fields (as in the 
jets of radio galaxies). They concluded that the total energy released 
in the latter form was comparable to the thermal energy of the gas 
in the Coma cluster Therefore, if all of this energy was injected 
into the gas, it could significantly modify the gas distribution of the 
cluster 

It is possible to make a similar estimate by averaging over all 
the baryons in the universe. From the observed luminosity density 
of AGN, the total mass density of black holes in the universe can 
be determined. Using the X-ray background intensity at 30 keV, 
Fabian & Iwasawa (1999) obtain a range of 6-9 x IO^Mq Mpc~^ 
for the black hole density. [This is h igher than earlier estimates 
( [Soltan 1982 ; Chokshi & Turner 1992), which used optical counts 
of AGN. It is likely that these counts suffered from strong intrinsic 
absorption (Fabian et al. 1998).] Assuming a mass-to-light conver- 



sion rate of 0. 1 and a black hole density of 6 x 10 M© Mpc~ , the 
total energy radiated by AGN is then 6.4 x 10^* erg Mpc~^. If the 
same amount of energy is available in relativistic particles and mag- 
netic fields, and it is divided uniformly over all the baryons in the 
universe, we would obtain an energy injection of 3.7 keV/particle. 
As before, we have assumed fij, = 0.08 and h = 0.5. This amount 
of heating would therefore be more than enough to break the self- 
similarity of clusters. 

On the downside, we note that only about 10 per cent of AGN 
have observed radio jets. If such jets provide the only mechanism 
for AGN to heat surrounding gas, then the estimated excess en- 
ergy would be correspondingly reduced. However, it is possible that 
radio-quiet quasars may also heat surrounding gas through outflows 
of thermal gas or poorly-coU imated 'jets' of radio-emitting plasma 
dKuncic I999|; [Fabian \99%. 



The advantage of this form of heating over supernova heat- 
ing is that it need not be intimately connected with the process of 
star formation. By obtaining the required energy from AGN, su- 
pernovae would be able to perform their usual role as regulators of 
star formation (see above). In addition, since an AGN is a single 
powerful source of energy, the gas being heated is more likely to be 
raised quickly to a very high temperature (> 1 keV). In this case, 
the cooling times would be comparable to those of X-ray clusters 
and radiative loss would be minimised. 



8.4 Preferential removal of gas 

As explained in section [7.4| . it is possible that the removal of cooled 
gas can result in an excess specific energy in the gas that remains 
to form the intracluster medium. 

The excess energy can be estimated from the subset of gas 
particles, in a cluster evolved without non-gravitational processes, 
which would remain in the gas halo if radiative cooling is included. 
If the subset has a more extended distribution than the entire gas 
halo, then a positive excess energy would result. This would occur 



if gas at smaller radii had a higher probability of cooling out than 
gas at larger radii. 

Such a scenario may occur as follows. Theory predicts that the 
first halos of a given mass to collapse should be much more strongly 
clustered than the background density distribution (Kaiser 1984). 
For instance, the large-scale over-densities that created present-day 
clusters also raised the overall density of smaller-scale fluctuations, 
so that the first galaxy halos to collapse had a high probability of 
being associated with future clusters. The above has been used to 
explain the strong clustering of 'Lyman-break galaxies' (LBG) ob- 
served at 2 ~ 3 ( ISteidel et al. 19981 lAdelberger et al. 1998( Gi- 
avalisco et al. 1998), where good agreement with theoretical pre- 
dictions have been obtained if the typical LBG is associated with a 
halo of mass ^ IO^^Mq. N-body simulations show that the densest 
peaks in the dis tribution of LBGs are l i kely to be the progen itors of 



future clusters (Governato et al. 1998; Wechsler et al. 1998). If we 



make the reasonable assumption that the large-scale over-density 
that led to a cluster was highest near the centre, then it seems likely 
that the LBGs would form preferentially near the centre of the clus- 
ter Naturally, as more of the proto-cluster goes non-linear, galax- 
ies would become more uniformly distributed in the proto-cluster 
Nevertheless, the first sub-halos of a given mass to collapse also 
have the highest mean gas density, so that they have the shortest 
cooling times. Hence gas is more likely to cool out, and be re- 
moved, near the centre of the cluster 

To obtain an upper bound on the excess energy obtainable 
in this way, we modelled the virialized cluster (in the no-cooling 
case) with singular isothermal spheres (p oc r~^) for both the 
gas and dark matter. Assuming a primordial baryon fraction of 
0.27, a cluster gas fraction of 0.17 is obtained if we remove all 
of the gas inside a radius of (10/27)r2oo in the above gas dis- 
tribution (recall that this amount of cooling was obtained in our 
simulations). The difference in iJgas before and after the gas is re- 
moved thus gives the excess energy. Since the gas is isothermal, 
it is only necessary to calculate the gravitational term in iJgas, for 
the thermal terms cancel when we take the difference. The result is 
E 

excess — (10/17) ln(10/27)GMtot/r-200 = 0.58GMtot/r2oo, 
where A/tot is the total mass of the halo. For the cluster displayed 
in Fig. |l5| (which has a virial radius of 1.46 Mpc), this gives an 
excess energy of 1.4 keV/particle. 

In reality the gas removed must be more extended than as- 
sumed above. Removing a uniform fraction of gas at each radius 
naturally leads to no excess energy. If we model the more general 
case by removing the gas in two component: a 'uniform' compo- 
nent, followed by all the gas inside a radius of fr 200, then we get 



So: 



/ln/GMto^ 
1 - / '■200 



(21) 



For example, if half of the gas removed in the uniform compo- 
nent, then / — 5/(27 — 5) = 5/22. This gives iSoxccss ~ 
0.44GAftot/?'200i or 1.0 keV/particle for the above cluster. In- 
creasing the uniform component to 3/4 of the gas removed, so 
that / = 2.5/19.5, gives Soxccss = 0.30GMtot/r2oo or 0.7 
keV/particle. 



In a separate paper (Wu, Fabian, & Nulsen 1999), we show 



that groups are even more strongly affected by heating than clus- 
ters, so that most of their gas is outside their virial radii. It does 
not seem possible for cooling alone to explain this phenomenon. 
Therefore, the above mechanism would have to be supplemented 
by heating in the conventional sense. 
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8.5 The bottom line 

Of the three main methods discussed, supernova heating appears 
only marginally acceptable based on current data, and requires a 
much higher heating efficiency than is commonly assumed. Pref- 
erential cooling also struggles to provide sufficient excess energy, 
and would not be able to explain our results for groups. Since it is 
possible for AGN to provide more than enough energy, this would 
be our preferred choice. However, the actual heating mechanism in 
this case is uncertain. It remains possible that all three mechanisms 
contribute to the excess energy. 



9 CONCLUSIONS 

We have constucted a self-consistent semi-analytic model which 
follows the excess energies resulting from supernova heating and 
radiative cooling, and modifies newly-collapsed gas halos accord- 
ingly. The gas profiles of virialized halos are selected from a 2- 
parameter family of polytropic gas profiles in NFW (1997) poten- 
tial wells. 

In the absence of non-gravitational heating or cooling, the gas 
halos of model clusters are approximately self-similar, in agree- 
ment with the results of hydrodynamic simulations. In particu- 
lar, their bulk properties follow self-similar scaling laws such as 
Lx oc T^. The model was then normalized by matching to the 
largest observed X-ray clusters, as these are least affected by non- 
gravitational heating. 

Four contrasting 'heating models' were used to investigate the 
excess energy required to match X-ray cluster data. Each heating 
model represented a different way of modifying gas profiles in the 
presence of heating. In addition we investigated the excess energy 
available from supernova heating in our model, and discussed ef- 
fects our model could not account for which may possibly con- 
tribute to the excess energy of gas halos. In the last section, we dis- 
cussed other approaches to obtaining the required excess energy, in- 
cluding a significantly modified model for supernova heating, heat- 
ing by AGN, and the removal of gas at low potentials. 

We summarize our main conclusions below: 

• The semi-analytic model is able to reproduce the observed 
Lx — T relation, temperature function, luminosity function and 
mass deposition rate function, provided the simulated X-ray clus- 
ters are given excess energies of ~ 1 keV/particle in order to break 
their self-similarity. 

• The excess energies required by each of the four heating mod- 
els to match the observed Lx — T relation lie in the range 1.8- 
3.0 keV/particle. By analysing a fiducial cluster with T ~ 2 
keV, we find that the minimum excess energy required is about 1 
keV/particle when all the available gas profiles are considered (the 
winning profile in this case is isentropic). We note that other au- 
thors^e^uire^similar^^^ 

1999). 

• If the process that produces the excess energy ejects gas in 
galactic winds at the escape velocity of the host halo (as assumed by 
our model), then the resulting excess energies in halos of all masses 
follow a distinct pattern. This is largely determined by the binding 
energies of halos and the halo merger tree. The excess energies 
are therefore not sensitive to parameters such as the efficiency of 
supernova heating, esN. 

• In this case, the resulting excess energies in clusters are only 
~ 0.1 keV/particle, an order of magnitude less than the required 
amount. 



• If the gas distribution is made more extended by a high level of 
energy injection before the cluster virialized, then a positive 'grav- 
itational contribution' to the excess energy is likely. This may help 
to ease the energy requirements and will need to be investigated 
with hydrodynamic simulations. 

• Of the approaches discussed in section ^ for obtaining the re- 
quired excess energy: more than enough energy is available from 
AGN, supernova heating is only marginally acceptable, and prefer- 
ential cooling struggles to provide sufficient excess energy. How- 
ever, it remains possible that all three mechanisms contribute to the 
excess energy of X-ray clusters. 

It seems likely that similar excess specific energies to th ose 
in clusters also occur in groups (Wu, Fabian, & Nulsen 1999), in 
which case a large fraction of the gas that belongs to groups would 
be outside their virial radii. This may explain their steeper Lx — T 
relation (see also Balogh et al. 1999). 

Future measurements of the gas density and temperature pro- 
files of groups and small clusters should clarify these issues, and 
place much stronger constraints on the excess energy in low- 
temperature clusters. 
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APPENDIX A: FORMULAE FOR MODELLING THE GAS 
PROCESSES 

The equations used to model the gas processes described in sec- 
tion ^ are given below, along with formulae for some observed 
quantities. Where required, we assume polytropi c ga s profiles in 



NFW potential wells, as derived in sections g and 4J We remind 
the reader that the gas pro files used in the formulae are notional, 
as explained in section 



2.2 



. To denote radius we use r and x inter- 
changeably, where r is the physical radius and x — r/rg. 



We always calculate the quantities Lx, M, emission- weighted 
temperature, and cooling flow power at a redshift of zero, thus any 
evolution in these quantities over the life of a halo is accounted for. 



AI The extent of cold gas, x^ 

When a new halo forms, the ratio between the cooling time and the 
free-fall time to the centre of the halo, r — tcooi/ta, determines 
whether gas is able to form a hot hydrostatic atmosphere. A hot gas 
halo forms when t > tq, where tq is a parameter of the model. If 
T < To then the gas remains cold in general (as virialization shocks 
would be radiative and any heating would be transitory). When r 
is greater or less than to everywhere, the above criteria are simple 
to apply. Otherwise, if r increases monotonically with radius, then 
there is a unique radius, x^i, where r = tq. Gas inside of x^i is 
then classified as cold and the remaining gas forms a hot gas halo. 
In all cases, our model requires there to be one radius Xd which lies 
in the range to c, such that gas inside x^ is classified as cold and 
that outside, hot. Since r(x) is comparable to to in a narrow range 
of halo masses (corresponding to normal galaxies), the variation of 
T with radius is of concern only for this mass scale. For this reason 
we will only discuss in detail gas profiles used in section |^ (i.e. 
those belonging to Models A and B). We first derive the general 
expression for t(x) before considering less well-behaved cases. 
The cooling time of gas is given by 

_ 3 pgkT/finiH 
2 nennA{l ) 

where pg, T, Ue and nn (the electron and hydrogen number den- 
sities respectively) are all functions of r. The three densities are 
simply proportional to each other. The cooling rate is given by 
rzeTZHA(r), wh ere A(T) is the cooling functi on. We use the cool- 
ing function of Bohringer & Hensler (1989 ), which depends on 
metallicity as well as temperature. We assume that the metallic- 
ity of every gas halo is constant with radius. A simple estimate of 
tg is obtained by computing the free-fall time for a test particle to 
reach the centre of a sphere of uniform density: 



ts = 



16Gpt, 



(A2) 



where ptot, the total density of the halo at the radius concerned, 
has been substituted for this density. (The formula given is a factor 
of -\/2 greater than that for a collapsing sphere of uniform density.) 
This method does not account for the increased ptot towards the 
centre of the halo, hence it is a slight overestimate. 
It follows that 



tc 



ts 




T 



1/2 



77200 2200 A(r)pg 



(A3) 



where we have used the expressions 77200 = Qp77iH/(fcT2oo) 
and a — 4,-KGpsr'i, the latter being the characteristic potential 
of the NFW profile. We assume a primordial composition of 0.9 
hydrogen to 0.1 helium by number, which gives p = 0.619 and 
Pg/ {nenn) = 1.707771^. Expanding ptot(a;) and Pg(a::), we obtain 




ap. 



1/2 



77200Pg,200 A{T)g{x) ' 



(A4) 



where we have defined 

g{x)^x''\l + x) 
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Figure Al. Plot of l/g{x) (which is roughlY_proportional to t{x)), using 
the same parameters and linestyles as in Fig. eI The solid curve is given by 
7 = 1 and r)200 = 10; the dotted curves are obtained by reducing r;200. ^s 
in Model B, and the dashed curves by increasing 7, as in Model A (see text 
for discussion). 



7 



-??200 



ln(l + x) ln(l + c) 



(A5) 



If < Xc{ < c, then the equation T{xcf) = tq is solved numeri- 
cally. 

To obtain Xd, the model follows an algorithm which first de- 
termines whether or not t{x) is 'well-behaved'. This is done by 
first approximating it as proportional to l/g{x). Although A(T) 
is a complicated function when considered over several decades of 
temperature, the amount that T can vary in a given halo is limited. 
The steepest temperature profile we use that may be of concern is 
given by 7 = 5/3 and 77200 ~ 10, which is used in Model A. Here 
the temperature rises by about a factor of 3.5 from r2oo to the cen- 
tre. In general, the temperature range in a halo is much smaller, so 
that the mean variation of A(T) in halos is not large. 



In Fie 



Al 



we illustrate the general behaviour ofl/g{x), using 
c = 5 and the same values as in Fig. ^ for 7 and 7^200 • The qualita- 
tive behaviour is the same for other values of c. For small enough 
X, l/g(x) always diverges. This is simply due to the divergence of 
the NFW density profile and as long as the minimum occurs at suf- 
ficiently small X, as with the solid curve (7=1 and 77200 = 10), it 
is ignored. As we decrease 77200 or increase 7, the minimum moves 
to larger radii and l/g{x) becomes a flatter function. Eventually, 
the minimum disappears and if 7 is large, 1/ g[x) becomes a steep 
decreasing function of x. 

Since r and its interpretation are approximate, we use an al- 
gorithm which is relatively simple. The criteria for whether l/g(x) 
is well-behaved is given by its slope at a:: = 0.5. When this slope 
is positive (the well-behaved case), l/g{x) is sure to have a min- 
imum inside x = 0.5. If r > ro at this minimum, then Xd = 0; 
if r(c) < ro, then x^f = c; otherwise, t(xc{) = ro is solved 
numerically. 

When this slope is negative instead, 1 / g{x) is either relatively 
flat or strongly decreasing at larger radii (see Fig. Al; note that the 



latter occurs in Model A but not Model B). In this case, if r (0.5) > 
tq {x = 0.5 being where the slope is measured), then Xd = 
and all the gas is considered hot. We have made the assumption 
that even if r < ro at larger radii, there is sufficient hot gas in 
the centre to provide a working surface on which infalling gas can 



shock to high temperatures, so that a hydrostatic atmosphere can 
still form (as discussed in section ^. If r(0.5) < tq, then a^cf = c 
and all the gas is considered cold. (For completeness, the algorithm 
actually allows for the situation, very rare in Models A and B, when 
r(c) > r(0.5) in a 'poorly-behaved' halo. In this case, it finds a 
numerical solution if ro lies between r(0.5) and r(c).) 

A2 The fraction, /unbind, of cold gas that forms stars 

Supernova feedback from star formation is assumed to eject the 
rest of the gas once there is sufficient energy to do so. The fraction, 
/unbind, of cold gas that forms stars is given by, 

Mgas(a; < a;cf) 



/^unbind ~ 



-esN4 X 10"" erg = Ms^^{x > Xct) \Eg 



MsN 

+ (1 - /unbind) Mgas (a; < X^f) \Ecold\ 



(A6) 



where Afgas is the total gas mass in the specified region and AfsN 
is the mass of stars formed per resulting Type II supern ova. For a 
standard IMF Msn = 80 Mq ([Thomas & Fabian 1990[). Since we 



boost supernova rates by a factor of 5, AfsN ~ 16 Mq in this pa- 
per. The energy released by one supernova into surrounding gas is 
esN 4 X 10^*^ erg. iSgas is defined in equation |5| and i?coid is defined 
as for -Egas except that the thermal term is set equal to zero. I-Bgas 
and I -Ecoid | are average quantities which estimate the energies per 
unit mass required to eject the hot and cold gas respectively. 

If the solution to /unbind in the above equation is greater than 
1, then the gas halo is not ejected. In this case all the cold gas is 
able to form stars and /unbind ~ 1- 



A3 The mass of BDM that forms from hot gas 

Whenever there is hot gas in a halo, some of it may be able to 
cool to form baryonic dark matter (BDM) before the next collapse. 
The cooling radius, rcooi, is obtained by solving numerically the 
equation: 



3 pgkT / fimu 



2 71e?XHA(T) 



Ai, 



(A7) 



where the Ihs is the cooling time and At is the time from virializa- 
tion to the next collapse or the present day, whichever is sooner. 

The mass of BDM formed is equal to the mass of gas inside 
'"cool minus the mass which has already formed stars, if any. Some- 
times no hot gas is able to cool in the given time, in which case no 
cooling flow operates. 

A4 The mass cooling rate, M 

The instantaneous mass cooling rate, M, is estimated using 



(r) 



dr 



drcooi(t) 



dt 



(A8) 



where A/g. 



is the gas mass inside a radius of r, t is the time 



since the virialization and At is as defined above. The cooling ra- 
dius as a function of t, r-cooi(i), is obtained by substituting t for At 
in equation 



dt 



By differentiating equation A7 with respect to rcooi, we obtain 
a d / rpg,20o 



3 pi 



(A9) 



drcool(t) 2 TleTlH 77200Pg,200''3A(r) dx \ T200pg 

where we have assumed that dA(r)/dr is small. Expanding the 
derivative gives 
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d (Tp, 



'g,200 



da; \^ T2ooPg 



7- 



7 



2 Pg,200 

— jyaoo 

Pg 



1 



ln(l + a;) 



.(AlO) 



for the conservation of momentum and 



M = 47r 



Since dAfgas(7')/dr = 47rpg(r)r^, it follows tliat 

7 x\pJp^,2a<i?k{T) 



Pe- 



dS _ „ dvi 



(B3) 



2 rierlH ''sPg,200 



Pi 



7 - 



ln(l+a;) 
^3 



for the conservation of energy. Here pg, T, P, 5* and v are, respec- 
.(All) tively, the density, temperature, pressure, specific entropy and ve- 
locity of the gas, (j) is the gravitational potential, and T the viscous 
stress tensor (with components Tij). The Lagrangian time deriva- 
tive is 



AS The cooling flow power 



The cooling flow power is the bolometric luminosity of the cooling 
flow region. It is given by 



5 MkT{r = Tc, 
2 



,ol) 



yumn 



(A 12) 



which uses the enthalpy, 5kT / {2pmii), to estimate the total en- 
ergy radiated per unit mass. It corresponds observationally to the 
bolometric luminosity inside the cooling radius. 



A6 The X-ray luminosity, Lx 

This is the sum of the cooling flow power and the bolometric lumi- 
nosity due to the gas outside rcooi : 



nenHA(r)47rr dr 



5 MkT{r 
2 



□ ol) 



pniH 



(A13) 



By the time of observation, the density profile of gas that be- 
longed to r < r-cooi differs substantially from that of the notional 
gas profile due to the effects of radiative cooling. Hence the cool- 
ing flow power is estimated separately. Although the changes due 
to cooling are also felt outside rcooi, because volume is a rapidly 
increasing function of the radius, the effect is only significant close 
to rcooi, so that we treat the atmosphere as unmodified from the 
notional gas profile outside rcooi- 



A7 The emission-weighted temperature 

This is the temperature that is implied whenever we refer to the 
temperature of a cluster as a whole (as in section We calculate 
the temperature as weighted by the luminosity outside rcooi- It is 
thus given by 



To, 



/J'"™ r(r)nenHA(T)47rr"dr 
J7'" ncnHA(r)47rr2dr 



(A14) 



APPENDIX B: GAS ENERGY EQUATION 

We derive below the equations that govern £gas and Cgas- £gas, 
defined in equation |l3| is the mean total specific energy of gas in 
a proto-halo. The definitions of proto-halo and proto-gas halo are 
given in section ^. We use egas (equation [l^ to follow the total 
specific energy of the gas at a position which moves with the gas. 
The gas equations may be written 

%+V-PgV = 0, 



dt 

for the conservation of mass, 
Pg^ =-VP + V-T-pgV,^, 



(Bl) 



(B2) 



d _ d 
di^ di~^^' 



(B4) 



The first term on the right in the energy equation is the viscous heat- 
ing rate. The second term, F, is the net additional heating rate per 
unit volume due to effects other than adiabatic and viscous heating. 
Such processes include supernova heating and radiative heat loss. 

The specific enthalpy is defined as H = e + PV, where e is 
the specific energy and V = 1/pg is the specific volume. Using 
the first law of thermodynamics, de — T dS — P dV, gives dH — 
TdS + V dP, so that 



dH dS 
Pg— = Pg^- 



dt 



dt 



dP 
'dt 



d 



3 



dP 

^r+ — +vVP,(B5) 

dt 



where we have used the energy equation ( p3| ) and expanded the La- 
grangian derivative. Using the momentum equation ( |b^ to replace 
VP in the last term gives, after some algebra. 



dH „ 
pg— = V 



(Tv) + r + 



dP 



d 1 2 



Pgv ■ V<^. 



dt ■ at 

Converting v ■ in the last term into time derivatives of 
rearranging, we get 



(B6) 
), and 



Pg 



H 



1 2 



dP 
'dt 



= Pg^+V- (Tv)+r. (B7) 



Using equation ^ and pgH — P — p^e, this can be rewritten as 



d_ 

dt 



Pg 



+ V- 



PgV + 



d(j) 



pg^ + v- (Tv) + r. 



(B8) 



Integrating this over a comoving volume V, we get 



d_ 

dt 



Pg 



dV - 



-Ps 



+ Tv 



■ dA 



+ 



(B9) 



where dA is a vector element of surface area. But, for any Q and 
comoving volume V, 



d_ 

dt 



>dV = 



d 

dl' 



QdV+ / Qv-dA, 



(BIO) 



av 



so that when the partial derivative on the Ihs of equation ^ is taken 
outside the integral, we get extra terms which cancel most of the 
surface terms, giving 



d_ 

dt 



Pg 



dV 



(-Pv + Tv) ■ dA + 



dV. 



(BID 
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In section 7.1 we assume a simplified scenario where no gas 



is 'removed' to form stars and BDM in the proto-halo. We also 
assume that the gas pressure and viscosity at the boundary of the 
proto-halo are negligible. If V is the volume occupied by gas in the 
proto-halo, then the surface integral above vanishes. Substituting 
3kT/{2j2mu) for e and using the definition of iSgas (equation p3|), 
we obtain the result 



1 



dt 



gas Jy 



( d<P 



+ r] dv. 



(B12) 



In sections to [7.4| , we follow the gas in the proto-halo in 
more detail, defining the proto-gas halo to include only gas that 
eventually belongs to the virialized gas halo. Thus the mass of the 
proto-gas halo is constant with time. The volume, V, that it oc- 
cupies is irregular at early times, containing 'pockets' of gas which 
are excluded from the proto-gas halo because they later convert into 
stars or BDM. Usin g gga s = (e + -|- tj)) and dm = pgasdV, 
we rewrite equation 



BU 



A J eg,,dm = J (-Pv+Tv).dA+^ (^p^^+p) dV,{m3) 

where J egas dm is the total energy of the proto-gas halo. The sur- 
face integral gives the rate at which the proto-gas halo does work 
on neighbouring gas. In section Q we investigate the pressure term 
only. 



where we have assumed that the integral scales as the inverse of 
the radius of the system, and that the turnaround radius is twice the 
virial radius. 

To obtain a rough estimate of the absolute value of -Bgas.G 
at tv, we assume that the kinetic term in equation [l^ is zero, 
and estimate the thermal term. The gravitational binding energy 
of the halo is equal to J (l/2)ptot<^d'l/. The virial theorem then 
implies that the thermal energy of the gas halo is approximately 
/gas(-l/2) J (l/2)ptot<^dV', where /gas = pg.c/ptot is a con- 
stant and possible boundary terms at r2oo have been ignored. 
Dividing by i\fgas gives the specific thermal energy of the gas: 
/(-l/4)pg,G<^dl//Mgas. Therefore, 



-pg,G(l>dV 



-3En 



(C6) 



as shown in Fig. ^ It follows that -Egas,G(ia) ~ — 2i5rr 



APPENDIX C: THE EVOLUTION OF Sgas g 



In this appendix, we obtain a simple expression for the variation of 
£'gas,G with time (where the subscript 'G' implies that the system is 
evolved without including non-gravitational processes), and obtain 
rough estimates of -Bgas.G at ta and tv (see Fig.|2^). 

If the gas and dark matter have the same distribution, then 
47rGpg,G ~ /gasV^</> for some constant /gas < 1. Now, by inte- 
grating by parts twice, we obtain Green's Theorem; 



(l>V'(j}dV ; 



■dA + 



)V>dT/, (CI) 



where (j> — d<j)/dt. If the integrals are made over all space and 
vanishes at infinity, then the surface integrals vanish. Since pg,G oc 
V^<j!>, it follows that 



dV. 



(C2) 



In order to substitute into equation B12, where the integration is 



made over the volume of the proto-halo only, we need to assume 
that pg,G ~ Plot = outside the proto-halo. The volume of inte- 
gration above c an the n be shrunk down to the proto-halo. Setting 
r = 0, equation pi3 gives 



AE,. 



s,G 



dt 

Therefore, 



Mgas dt 



2Pg,G. 



)dV. 



gas,G 



Mg, 



In section 7.1 



-Pg.G0 dl/ + constant. 



we define the quantity iSn 



\E, 



gas.G] 



(C3) 

(C4) 
.The 



above result thus implies that 



En 



-pg,G<f>dV 



-Pg,G0dV 



,(C5) 



